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BHC03 1 051 KSC/klu/NT/V2003-10-27 

METHODS AND COMPOSI TIONS FOR THE RESPONSE PREDICTION OF MATJQ. 
NANT NEOPLASIA TO TREATMENT 

TECHNICAL FIELD OF TTTF INVITATION 

The invention relates to methods and compositions for the prediction, diagnosis, prognosis, 
5 prevention and treatment of neoplastic disease. Of particular interest is the response prediction oi 

neoplastic lesions to various therapeutic regimens. Neoplastic disease is often caused by 
chromosomal rearrangements which lead to over- or underexpression of the rearranged genes. The 
invention discloses genes which are overexpressed in neoplastic tissue and are useful as diagnostic 
markers and targets for treatment. Methods are disclosed for predicting, diagnosing and 
1 0 prognosing as well as preventing and treating neoplastic disease. 

BACKGROUND OF THE INVENTION 

Chromosomal aberrations (amplifications, deletions, inversions, insertions, translocations and/or 
viral integrations) are of importance for the development of cancer and neoplastic lesions, as they 
account for deregulations of the respective regions. Amplifications of genomic regions have been 

15 described, in which genes of importance for growth characteristics, differentiation, invasiveness or 
resistance to therapeutic intervention are located. One of those regions with chromosomal 
aberrations is the region carrying the HER-2/neu gene which is amplified in breast cancer patients. 
In approximately 25% of breast cancer patients the HER-2/neu gene is overexpressed due to gene 
amplification. HER-2/neu overexpression correlates with a poor prognosis (relapse, overall 

20 survival, sensitivity to therapeutics). The importance of HER-2/neu for the prognosis of the 
disease progression has been described [Gusterson et aL, 1992, (1)]. Gene specific antibodies 
raised against HER-2/neu (Herceptin™) have been generated to treat the respective cancer 
patients. However, only about 50% of the patients benefit from the antibody treatment with 
Herceptin™, which is most often combined with chemotherapeutic regimen. The discrepancy of 

25 HER-2/neu positive tumors (overexpressing HER-2/neu to similar extent) with regard to 

responsiveness to therapeutic intervention suggest, that there might be additional factors or genes 
being involved in growth and apoptotic characteristics of the respective tumor tissues. There 
seems to be no monocausal relationship between overexpression of the growth factor receptor 
HER-2/neu and therapy outcome. In line with this the measurement of commonly used tumor 

30 markers such as estrogen receptor, progesterone receptor, p53 and Ki-67 do provide only very 
limited information on clinical outcome of specific therapeutic decisions. Therefore there is a great 
need for a more detailed diagnostic and prognostic classification of tumors to enable improved 
therapy decisions and prediction of survival of the patients. The present invention addresses the 
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need for additional markers by providing genes, which expression is deregulated in tumors and 
correlates with clinical outcome. One focus is the deregulation of genes present in specific 
chromosomal regions and their interaction in disease development and drug responsiveness. 

HER-2/neu and other markers for neoplastic disease are commonly assayed with diagnostic 
methods such as imjriunohistochernistry (IHC) (e.g. HercepTest™ from DAKO Inc.) and 
Fluorescence-In-Situ-Hybridization (FISH) (e.g. quantitative measurement of the HER-2/neu and 
Topoisomerase II alpha with a fluorescence-iw-s/ta-Hybridization kit from VYSIS). Additionally 
HER-2/neu can be. assayed by detecting HER-2/neu fragments in serum with an ELISA test 
(BAYER Corp.) or a with a quantitative PCR kit which compares the amount of HER-2/neu gene 
with the amount of a non-amplified control gene in order to detect HER-2/neu gene amplifications 
(ROCHE). These methods, however, exhibit multiple disadvantages with regard to sensitivity, 
specificity, technical and personnel efforts, costs, time consumption, inter-lab reproducibili^) 
These methods are also restricted with regard to measurement of multiple parameters within one 
patient sample ("multiplexing")* Usually only about 3 to 4 parameters (e.g. genes or gene 
products) can be detected per tissue slide. Therefore, there is a need to develop a fast and simple 
test to measure simultaneously multiple parameters in one sample. The present invention 
addresses the need for a fast and simple high-resolution method, that is able to detect multiple 
diagnostic and prognostic markers simultaneously. 

SUMMARY OF THE INVENTION 

The present invention is based on discovery that chromosomal alterations in cancer tissues can 
lead to changes in the expression of genes that are encoded by the altered chromosomal regions. 
Exemplary 43 human genes have been identified that are co-amplified in neoplastic lesions from 
breast cancer tissue resulting in altered expression of several of these genes (Tables 1 to 4). TnHe 
43 genes are differentially expressed in breast cancer states, relative to their expression in normal, 
or non-breast cancer states. The present invention relates to derivatives, fragments, analogues and 
homologues of these genes and uses or methods of using of the same. 

The present invention further relates to novel preventive, predictive, diagnostic, prognostic and 
therapeutic compositions and uses for malignant neoplasia and breast cancer in particular. 
Especially membrane bound marker gene products containing extracellular domains can be a 
particularly useful target for treatment methods as well as diagnostic and clinical monitoring 
methods. 
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It is a discovery of the present invention that several of these genes are characterized in that 
gene products functionally interact in signaling cascades or by directly or indirectly influer 
each other. This interaction is important for the normal physiology of certain non-neopl 
tissues (e.g. brain or neurogenic tissue). The deregulation of these genes in neoplastic les 
5 where they are normally exhibit of different level of activity or are not active, however, resul 
pathophysiology and affects the characteristics of the disease-associated tissue. 

The present invention further relates to methods for detecting these deregulations in malig 
neoplasia on DNA and mRNA level. 

The present invention further relates to a method for the detection of chromosomal altera! 
characterized in that the relative abundance of individual mRNAs, encoded by genes, locate 
altered chromosomal regions is detected. 

The present invention further relates to a method for the detection of the flanking breakpoin 
named chromosomal alterations by measurement of DNA copy number by quantitative PC 
DNA-Arrays and DNA sequencing. 

A method for the prediction, diagnosis or prognosis of malignant neoplasia by the detectio 
DNA sequences flanking named genomic breakpoint or are located within such. 

The present invention further relates to a method for the detection of chromosomal alterat 
characterized in that the copy number of one or more genomic nucleic acid sequences loc 
within an altered chromosomal region(s) is detected by quantitative PCR techniques 
TaqMan™, Lightcycler™ and iCycler™). 

The present invention further relates to a method for the prediction, diagnosis or prognosi 
malignant neoplasia by the detection of at least 2 markers whereby the markers are genes 
fragments thereof or genomic nucleic acid sequences that are located on one chromosomal re 
which is altered in malignant neoplasia and breast cancer in particular. 

25 The present invention also discloses a method for the prediction, diagnosis or prognosi 
malignant neoplasia by the detection of at least 2 markers whereby the markers are located on 
or more chromosomal region(s) which is/are altered in malignant neoplasia; and the mar 
interact as (i) receptor and ligand or (ii) members of the same signal transduction pathwa 
(iii)members of synergistic signal transduction pathways or (iv) members of antagonistic si 

30 transduction pathways or (v) transcription factor and transcription factor binding site. 
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Also dislcosed is a method for the prediction, diagnosis or prognosis of malignant neoplasia by the 
detection of at least one marker whereby the marker is a VNTR, SNP, RFLP or STS which is 
located on one chromosomal region which is altered in malignant neoplasia due to amplification 
and the marker is detected in (a) a cancerous and (b) a non cancerous tissue or biological sample 
5 from the same individual. A preferred embodiment is the detection of at least one VNTR marker 
of Table 6 or at least on SNP marker of Table 4 or combinations thereof. Even more preferred can 
the detection, quantification and sizing of such polymorphic markers be achieved by methods of 
(a) for the comparative measurement of amount and size by PCR amplification and subsequent 
capillary electrophoresis, (b) for sequence determination and allelic discrimination by gel electro- 

10 phoresis (e.g. SSCP, DGGE), real time kinetic PCR, direct DNA sequencing, pyro-sequencing, 
mass-specific allelic discrimination or resequencing by DNA array technologies, (c) for the 
dertermination of specific restriction patterns and subsequent electrophoretic separation and (d) for 
allelic discrimination by allel specific PCR (e.g. ASO). An even more favorable detection of^^ 
hetrozygous VNTR, SNP, RFLP or STS is done in a multiplex fashion, utilizing a variety of 

15 labeled primers (e.g. fluorescent, radioactive, bioactive) and a suitable capillary electrophoresis 
(CE) detection system. 

In another embodiment the expression of these genes can be detected with DNA-arrays as 
described in W09727317 and US6379895. 

In a further embodiment the expression of these genes can be detected with bead based direct 
20 flourescent readout techniques such as described in W097 14028 and WO9952708. 

In one embodiment, the invention pertains to a method of determining the phenotype of a cell or 
tissue, comprising detecting the differential expression, relative to a normal or untreated cell, of at 
least one polynucleotide comprising SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19 or 21 to 26 or 5^1 
75, wherein the polynucleotide is differentially expressed by at least about 1.5 fold, at least about 2 
25 fold or at least about 3 fold. 

In a further aspect the invention pertains to a method of determining the phenotype of a cell or 
tissue, comprising detecting the differential expression, relative to a normal or untreated cell, of at 
least one polynucleotide which hybridizes under stringent conditions to one of the polynucleotides 
of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19 or 21 to 26 or 53 to 75 and encodes a polypeptide 
30 exhibiting the same biological function as given in Table 2 or 3 for the respective polynucleotide, 
wherein the polynucleotide is differentially expressed by at least at least about 1 .5 fold , at least 
about 2 fold or at least about 3 fold. 
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In another embodiment of the invention a polynucleotide comprising a polynucleotide select 
from SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19 or 21 to 26 and 53 to 75 or encoding one oft 
polypeptides with SEQ ID NO: 28 to 32, 34, 35, 37 to 42, 44, 45 or 47 to 52 or 76 to 98 can 
used to identify cells or tissue in individuals which exhibit a phenotype predisposed to bre* 
cancer or a diseased phenotype, thereby (a) predicting whether an individual is at risk for tl 
development, or (b) diagnosing whether an individual is having, or (c) proposing the progress* 
or the outcome of the treatment malignant neoplasia and breast cancer in particular. 

In yet another embodiment the invention provides a method for identifying genomic regions whit 
are altered on the chromosomal level and encode genes that are linked by function and a. 
differentially expressed in malignant neoplasia and breast cancer in particular. 

In yet another embodiment the invention provides the genomic regions 17q21, 3p21 and 12ql3 fc 
use in prediction, diagnosis and prognosis as well as prevention and treatment of malignar 
neoplasia and breast cancer. In particular not only the intragenic regions, but also intergeni 
regions, pseudogenes or non-transcribed genes of said chromosomal regions can be used fo 
diagnostic, predictive, prognostic and preventive and therapeutic compositions and methods 
Therefore sequences of coding or non-coding regions as depicted in this invention are offered * 
way of illustration and not by way of limitation. As one aspect of this, genomic sequences h 
between the genomic sequences depicted can be used for similiar purposes. 

In yet another embodiment the invention provides methods of screening for agents which regulate 
the activity of a polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 
to 98 or encoded by a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 
26 and 53 to 75. A test compound is contacted with a polypeptide comprising a polypeptide 
selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a 
polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75. Binding of the test compound to 
the polypeptide is detected. A test compound which binds to the polypeptide is thereby identified 
as a potential therapeutic agent for the treatment of malignant neoplasia and more particularly 
breast cancer. 

In even another embodiment the invention provides another method of screening for agents which 
regulate the activity of a polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 
52 and 76 to 98 or encoded by a polynucleotide comprising a polynucleotide selected from SEQ 
ID NO: 1 to 26 and 53 to 75. A test compound is contacted with a polypeptide comprising a 
polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide 
comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75. A biological activity 
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mediated by the polypeptide is detected. A test compound which decreases the biological activity 
is thereby identified as a potential therapeutic agent for decreasing the activity of the polypeptide 
encoded by a polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 
98 or encoded by a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 
and 53 to 75 in malignant neoplasia and breast cancer in particular. A test compound which 
increases the biological activity is thereby identified as a potential therapeutic agent for increasing 
the activity of the polypeptide encoded by a polypeptide selected from one of the polypeptides 
with SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a 
polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 in malignant neoplasia and breast 
cancer in particular. 

In another embodiment the invention provides a method of screening for agents which regulate the 
activity of a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 50 
to 75. A test compound is contacted with a polynucleotide comprising a polynucleotide selected 
from SEQ ID NO: 1 to 26 and 53 to 75. Binding of the. test compound to the polynucleotide 
comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 is detected. A test 
compound which binds to the polynucleotide is thereby identified as a potential therapeutic agent 
for regulating the activity of a polynucleotide comprising a polynucleotide selected from SEQ ID 
NO: 1 to 26 and 53 to 75 in malignant neoplasia and breast cancer in particular. 

The invention thus provides polypeptides selected from one of the polypeptides with SEQ ID NO: 
27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a polynucleotide selected from 
SEQ ID NO: 1 to 26 and 53 to 75 which can be used to identify compounds which may act, for 
example, as regulators or modulators such as agonists and antagonists, partial agonists, inverse 
agonists, activators, co-activators and inhibitors of the polypeptide comprising a polypepti^ 
selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a 

- - - < -»/' 1 CI Ar»or>rri;nar1v the invention 

polynucleotide selected trom st-vj u-> i to iU au« -/-. «. * <=-■>• 

provides reagents and methods for regulating a polypeptide comprising a polypeptide selected 
from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a 
polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 in malignant neoplasia and more 
particularly breast cancer. The regulation can be an up- or down regulation. Reagents that 
modulate the expression, stability or amount of a polynucleotide comprising a polynucleotide 
selected from SEQ ED NO: 1 to 26 and 53 to 75 or the activity of the polypeptide comprising a 
polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide 
comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 can be a protein, a 
peptide, a peptidomimetic, a nucleic acid, a nucleic acid analogue (e.g. peptide nucleic acid, 
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locked nucleic acid) or a small molecule. Methods that modulate the expression, stability 
amount of a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and : 
to 75 or the activity of the polypeptide comprising a polypeptide selected from SEQ ID NO: 27 
52 and 76 to 98 or encoded by a polynucleotide comprising a polynucleotide selected from SE 
ID NO: 1 to 26 and 53 to 75 can be gene replacement therapies, antisense, ribozyme and triph 



nucleic acid approaches. 



In one embodiment of the invention provides antibodies which specifically bind to a full-length < 
partial polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 c 
encoded by a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 an 
53 to 75 or a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 an 
53 to 75 for use in prediction, prevention, diagnosis, prognosis and treatment of malignar 
neoplasia and breast cancer in particular. 

Yet another embodiment of the invention is the use of a reagent which specifically binds to : 
polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 or i 
polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or encode< 
by a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 7i 
in the preparation of a medicament for the treatment of malignant neoplasia and breast cancer ir 
particular. 

Still another embodiment is the use of a reagent that modulates the activity or stability of a 
polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded 
by a polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 
or the expression, amount or stability of a polynucleotide comprising a polynucleotide selected 
from SEQ ID NO: 1 to 26 and 53 to 75 in the preparation of a medicament for the treatment of 
malignant neoplasia and breast cancer in particular. 

Still another embodiment of the invention is a pharmaceutical composition which includes a 
reagent which specifically binds to a polynucleotide comprising a polynucleotide selected from 
SEQ ID NO: 1 to 26 and 53 to 75 or a polypeptide comprising a polypeptide selected from SEQ ID 
NO: 27 to 52 and 76 to 98 or encoded by a polynucleotide comprising a polynucleotide selected 
from SEQ ID NO: 1 to 26 and 53 to 75, and a pharmaceutical^ acceptable carrier. 

Yet another embodiment of the invention is a pharmaceutical composition including a 
polynucleotide comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 or 
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encoding a polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 
98. 

In one embodiment, a reagent which alters the level of expression in a cell of a polynucleotide 
comprising a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 or encoding a 
polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98, or a 
sequence complementary thereto, is identified by providing a cell, treating the cell with a test 
reagent, determining the level of expression in the cell of a polynucleotide comprising a 
polynucleotide selected from SEQ ED NO: 1 to 26 and 53 to 75 or encoding a polypeptide 
comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or a sequence comple- 
mentary thereto, and comparing the level of expression of the polynucleotide in the treated cell 
with the level of expression of the polynucleotide in an untreated cell, wherein a change in the 
level of expression of the polynucleotide in the treated cell relative to the level of expression of th£ 
polynucleotide in the untreated cell is indicative of an agent which alters the level of expression of 
- the polynucleotide in a cell. 

The invention further provides a pharmaceutical composition comprising a reagent identified by 
this method. 

Another embodiment of the invention is a pharmaceutical composition which includes a 
polypeptide comprising a polypeptide selected from SEQ ID NO: 27 to 52 and 76 to 98 or which is 
encoded by a polynucleotide comprising a polynucleotide selected from SEQ ED NO: 1 to 26 and 
53 to 75. 

A further embodiment of the invention is a pharmaceutical composition comprising a 
polynucleotide including a sequence which hybridizes under stringent conditions to^ 
polynucleotide comprising a polynucleotide selected from SEQ ED NO: I to 26 and 53 to 75 and 
encoding a polypeptide exhibiting the same biological function as given for the respective 
polynucleotide in Table 2 or 3, or encoding a polypeptide comprising a polypeptide selected from 
SEQ ED NO: 27 to 52 and 76 to 98. Pharmaceutical compositions, useful in the present invention 
may further include fusion proteins comprising a polypeptide comprising a polynucleotide selected 
from SEQ ID NO: 27 to 52 and 76 to 98, or a fragment thereof, antibodies, or antibody fragments 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a sketch of the chromosome 17 with G-banding pattern and cytogenetic positions. 
In the blow out at the lower part of the figure a detailed view of the chromosomal area of 
the long arm of chromosome 17 (17ql2-21.1) is provided. Each vertical rectangle depicted 
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in medium gray, represents a gene as labeled below or above the individual position- The 
order of genes depicted in this graph has been deduced from experiments questioning the 
amplification an over expression and from public available data (e.g. UCSC, NCBI or 
Ensemble). 

Fig. 2 shows the same region as depicted before in Fig. 1 and a cluster representation of the 
individual expression values measured by DNA-chip hybridization. The gene representing 
squares are indicated by a dotted line. In the upper part of the cluster representation 4 
tumor cell lines, of which two harbor a known HER-2/neu over expression (SKBR3 and 
AU565), are depicted with their individual expression profiles. Not only the HER-2/neu 
gene shows a clear over expression but as provided by this invention several other genes 
with in the surrounding. In the middle part of the cluster representation expression data 
obtained from immune histochemically characterized tumor samples are presented. Two of 
the depicted probes show a significant over expression of genes marked by the white 
rectangles. For additional information and comparison expression profiles of several non 
diseased human tissues (rnas obtained from Clontech Inc.) Are provided. Closest relation 
to the expression profile of HER-2/neu positive tumors displays human brain and neural 
tissue. 

Fig. 3 provides data from DNA amplification measurements by qpcr (e.g. Taqman). Data 
indicates that in several analyzed breast cancer cell lines harbor amplification of genes 
which were located in the previously described region (ARCHEON). Data were displayed 
for each gene on the x-axis and 40-Ct at the y-axis. Data were normalized to the 
expression level of GAPDH as seen in the first group of columns. 

Fig. 4 represents a graphical overview on the amplified regions and provides information on the 
length of the individual amplification and over expression in the analyzed tumor cell lines. 
The length of the amplification and the composition of genes has a significant impact on 
the nature of the cancer cell and on the responsiveness on certain drugs, as described 
elsewhere. 

DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

"Differential expression", as used herein, refers to both quantitative as well as qualitative 
differences in the genes' expression patterns depending on differential development and/or tumor 
growth. Differentially expressed genes may represent "marker genes," and/or "target genes". The 
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expression pattern of a differentially expressed gene disclosed herein may be utilized as part of a 
prognostic or diagnostic breast cancer evaluation. Alternatively, a differentially expressed gene 
disclosed herein may be used in methods for identifying reagents and compounds and uses of these 
reagents and compounds for the treatment of breast cancer as well as methods of treatment. 

5 "Biological activity" or "bioactivity" or "activity*' or "biological function'^ which are used 

interchangeably, herein mean an effector or antigenic function that is directly or indirectly 
performed by a polypeptide (whether in its native or denatured conformation), or by any fragment 
thereof in vivo or in vitro. Biological activities include but are not limited to binding to 
polypeptides, binding to other proteins or molecules, enzymatic activity, signal transduction, 
10 activity as a DNA binding protein, as a transcription regulator, ability to bind damaged DNA, etc. 
A bioactivity can be modulated by directly affecting the subject polypeptide. Alternatively, a 
bioactivity can be altered by modulating the level of the polypeptide, such as by modulatir^ 
expression of the corresponding gene. 

The term "marker" or "biomarker" refers a biological molecule, e.g., a nucleic acid, peptide, 
15 hormone, etc., whose presence or concentration can be detected and correlated with a known 
condition, such as a disease state. 

"Marker gene," as used herein, refers to a differentially expressed gene which expression pattern 
may be utilized as part of predictive, prognostic or diagnostic malignant neoplasia or breast cancer 
evaluation, or which, alternatively, may be used in methods for identifying compounds useful for 
20 the treatment or prevention of malignant neoplasia and breast cancer in particular. A marker gene 
may also have the characteristics of a target gene. 

"Target gene", as used herein, refers to a differentially expressed gene involved in breast cancei^^ 
a manner by which modulation of the level of target gene expression or of target gene product 
activity may act to ameliorate symptoms of malignant neoplasia and breast cancer in particular. A 
25 target gene may also have the characteristics of a marker gene. 

The term "biological sample", as used herein, refers to a sample obtained from an organism or 
from components (e.g., cells) of an organism. The sample may be of any biological tissue or fluid. 
Frequently the sample will be a "clinical sample" which is a sample derived from a patient. Such 
samples include, but are not limited to, sputum, blood, blood cells (e.g., white cells), tissue or fine 
30 needle biopsy samples, cell-containing bodyfluids, free floating nucleic acids, urine, peritoneal 
fluid, and pleural fluid, or cells therefrom. Biological samples may also include sections of tissues 
such as frozen sections taken for histological purposes. 
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By "array" or "matrix" is meant an arrangement of addressable locations or "addresses" on 
device. The locations can be arranged in two dimensional arrays, three dimensional arrays, 
other matrix formats. The number of locations can range from several to at least hundreds 
thousands. Most importantly, each location represents a totally independent reaction site. Am* 
include but are not limited to nucleic acid arrays, protein arrays and antibody arrays. A "nucle 
acid array- refers to an array containing nucleic acid probes, such as oligonucleotide 
polynucleotides or larger portions of genes. The nucleic acid on the array is preferably sing 
stranded. Arrays wherein the probes are oligonucleotides are referred to as "oligonucleotic 
arrays" or "oligonucleotide chips." A "microarray," herein also refers to a "biochip" or "biologic; 
chip", an array of regions having a density of discrete regions of at least about 100/cm 2 an 
preferably at least about lOOOW. The regions in a microarray have typical dimensions,' e.g 
ammeters, in the range of between about 10-250 M m, and are separated from other regions in th 
array by about the same distance. A "protein array" refers to an array containing polypeptid 
probes or protein probes which can be in native form or denatured. An "antibody array" refers t, 
an array containing antibodies which include but are not limited to monoclonal antibodies (e.g 
from a mouse), chimeric antibodies, humanized antibodies or phage antibodies and single chai, 
antibodies as well as fragments from antibodies. 

The term "agonist", as used herein, is meant to refer to an agent that mimics or upregulafes (e g 
potentiates or supplements) the bioactivity of a protein. An agonist can be a wild-type protein or 
denvative thereof having at least one bioactivity of the wild-type protein. An agonist can also be a 
compound that upregulates expression of a gene or which increases at least one bioactivity of a 
protein. An agonist can also be a compound which increases the interaction of a polypeptide with 
another molecule, e.g., a target peptide or nucleic acid. 

The term "antagonist" as used herein is meant to refer to an agent that downregulates (e g 
suppresses or inhibits) at least one bioactivity of a protein. An antagonist can be a compound 
which inhibits or decreases the interaction between a protein and another molecule, e.g., a target 
peptide, a ligand or an enzyme substrate. An antagonist can also be a compound that 
downregulates expression of a gene or which reduces the amount of expressed protein present. 

"Small molecule" as used herein, is meant to refer to a composition, which has a molecular weight 
of less than about 5 kD and most preferably less than about 4 kD. Small molecules can be nucleic 
acids, peptides, polypeptides, peptidomimetics, carbohydrates, lipids or other organic (carbon- 
containing) or inorganic molecules. Many pharmaceutical companies have extensive libraries of 
chemical and/or biological mixtures, often fungal, bacterial, or algal extracts, which can be 
screened with any of the assays of the invention to identify compounds that modulate a bioactivity 
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The terms "modulated" or "modulation" or "regulated" or "regulation" and "differentially 
regulated" as used herein refer to both upregulation (i.e., activation or stimulation (e.g., by 
agonizing or potentiating) and down regulation [i.e., inhibition or suppression (e.g., by 
antagonizing, decreasing or inhibiting)]. 

'Transcriptional regulator/ unit" refers to DMA sequences, such as initiation signals, enhancers, 
and promoters, which induce or control transcription of protein coding sequences with which they 
are operably linked. In preferred embodiments, transcription of one of the genes is under the 
control of a -promoter sequence (or other transcriptional regulatory sequence) which controls the 
expression of the recombinant gene in a cell-type in which expression is intended. It will also be 
understood that the recombinant gene can be under the control of transcriptional regulatory 
sequences which are the same or which are different from those sequences which control 
transcription of the naturally occurring forms of the polypeptide. ^ 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylate, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived. 

The term "nucleotide analog" refers to oligomers or polymers being at least in one feature 
different from naturally occurring nucleotides, oligonucleotides or polynucleotides, but exhibiting 
functional features of the respective naturally occurring nucleotides (e.g. base paring 
hybridization, coding information) and that can be used for said compositions. The nucleotiW 
analogs can consist of non-naturally occurring bases or polymer backbones, examples of which are 
LNAs, PNAs and Morpholinos. The nucleotide analog has at least one molecule different from its 
naturally occurring counterpart or equivalent. 

"BREAST CANCER GENES" or "BREAST CANCER GENE" as used herein refers to the 
polynucleotides of SEQ ID NO: 1 to 26 and 53 to 75, as well as derivatives, fragments, analogs 
and homologues thereof, the polypeptides encoded thereby, the polypeptides of SEQ ID NO: 27 to 
52 and 76 to 98 as well as derivatives, fragments, analogs and homologues thereof and the 
corresponding genomic transcription units which can be derived or identified with standard 
techniques well known in the art using the information disclosed in Tables 1 to 5 and Figures 1 to 
4. The GenBank, Locuslink ID and the UniGene accession numbers of the polynucleotide 
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sequences of the SEQ ID NO: 1 to 26 and 53 to 75 and the polypeptides of the SEQ ID NO: 2 
52 and 76 to 98 are shown in Table 1, the gene description, gene function and subceli 
localization is given in Tables 2 and 3. 

The term "chromosomal region" as used herein refers to a consecutive DNA stretch c 
chromosome which can be defined by cytogenetic or other genetic markers such as e.g. restric 
length polymorphisms (RFLPs), single nucleotide polymorphisms (SNPs), expressed sequ< 
tags (ESTs), sequence tagged sites (STSs), microsatellites, variable number of tandem rep 
(VNTRs) and genes. Typically a chromosomal region consists of up to 2 Megabases (MB), up 
MB, up to 6 MB, up to 8 MB, up to 10 MB, up to 20 MB or even more MB. 

The term "altered chromosomal region" or" abberant chromosomal region" refers to a struct 
change of the chromosomal composition and DNA sequence, which can occur by the follov 
events: amplifications, deletions, inversions, insertions, translocations and/or viral integration 
trisomy, where a given cell harbors more than two copies of a chromosome, is within the meai 
of the term "amplification" of a chromosome or chromosomal region. 

The present invention provides polynucleotide sequences and proteins encoded thereby, as we 
probes derived from the polynucleotide sequences, antibodies directed to the encoded proteins, 
predictive, preventive, diagnostic, prognostic and therapeutic uses for individuals which are at 
for or which have malignant neoplasia and breast cancer in particular. The sequences disclo: 
herein have been found to be differentially expressed in samples from breast cancer. 

The present invention is based on the identification of 43 genes that are differentially regul: 
(up- or downregulated) in tumor biopsies of patients with clinical evidence of breast cancer, 
identification of 43 human genes which were not known to be differentially regulated in br 
cancer states and their significance for the disease is described in the working examples her 
The characterization of the co-expression of these genes provides newly identified roles in br 
cancer. The gene names, the database accession numbers (GenBank and UniGene) as well as 
putative or known functions of the encoded proteins and their subcellular localization are give 
Tables 1 to 4. The primer sequences used for the gene amplification are shown in Table 5. 

In either situation, detecting expression of these genes in excess or in with lower levei 
compared to normal expression provides the basis for the diagnosis of malignant neoplasia 
breast cancer. Furthermore, in testing the efficacy of compounds during clinical trials, a decn 
in the level of the expression of these genes corresponds to a return from a disease condition 
normal state, and thereby indicates a positive effect of the compound. 
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Another aspect of the present invention is based on the observation that neighboring genes within 
defined genomic regions functionally interact and influence each others function directly or 
indirectly. A genomic region encoding functionally interacting genes that are co-amplified and co- 
expressed in neoplastic lesions has been defined as an "ARCHEON". (ARCHEON = Altered 
Region of Changed Chromosomal Expression Observed in Neoplasms). Chromosomal alterations 
often affect more than one gene. This is true for amplifications, duplications, insertions, 
integrations, inversions, translocations, and deletions. These changes can have influence on the 
expression level of single or multiple genes. Most commonly in the field of cancer diagnostics and 
treatment the changes of expression levels have been investigated for single, putative relevant 
target genes such as MLVI2 (5pl4), NRASL3 (6 P 12), EGFR (7pl2), c-myc (8q23), Cyclin Dl 
(llql3), IGF1R (15q25), HER-2/neu (17q21), PCNA (20ql2). However, the altered expression 
level an'd interaction of multiple (i.e. more than two) genes within one genomic region with each 
other has not been addressed. Genes of an ARCHEON form gene clusters with tissue specifiW 
expression patterns. The mode of interaction of individual genes within such a gene cluster 
suspected to represent an ARCHEON can be either protein-protein or protein-nucleic acm 
interaction, which may be illustrated but not limited by the following examples: ARCHEON gene 
interaction may be in the same signal transduction pathway, may be receptor to ligand binding, 
receptor kinase and SH2 or SH3 binding, transcription factor to promoter binding, nuclear 
hormone receptor to transcription factor binding, phosphogroup donation (e.g. kinases) and 
acceptance (e.g. phosphoprotein), mRNA stabilizing protein binding and transcriptional processes. 
The individual activity and specificity of a pair genes and or the proteins encoded thereby or of a 
group of such in a higher order, may be readily deduced from literature, published or deposited 
within public databases by the skilled person. However in the context of an ARCHEON the 
interaction of members being part of an ARCHEON will potentiate, exaggerate or reduce th|r 
singular functions. This interaction is of importance in defined normal tissues in which they W 

a thfQP clusters have been commonly conserved during 

normany cu-cApiwowu. ~- — , 

evolution. The aberrant expression of members of these ARCHEON in neoplastic lesions, 
however, (especially within tissues in which they are normally not expressed) has influence on 
tumor characteristics such as growth, invasiveness and drug responsiveness. Due to the interaction 
of these neighboring genes it is of important to determine the members of the ARCHEON which 
are involved in the deregulation events. In this regard amplification and deletion events .n 
neoplastic lesions are of special interest. 

The invention relates to a method for the detection of chromosomal alterations by (a) determining 
the relative mRNA abundance of individual mRNA species or (b) determining the copy number of 
one or more chromosomal region(s) by quantitative PCR. In one embodiment information on the 
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genomic organization and spatial regulation of chromosomal regions is assessed by bioinfom 
analysis of the sequence information of the human genome (UCSC, NCBI) and then comb 
with RNA expression data from GeneChip™ DNA-Arrays (Affymetrix) and/or quantitative : 
(TaqMan) from RNA-samples or genomic DNA. 

5 In a further embodiment the functional relationship of genes located on a chromosomal re 
which is altered (amplified or deleted) is established. The altered chromosomal region is del 
as an ARCHEON if genes located on that region functionally interact. 

The 17q21 locus was investigated as one model system, harboring the HER-2/neu gene, 
establishing a high-resolution assay to detect amplification events in neighboring genes, 43 g 
that are commonly co-amplified in breast cancer cell lines and patient samples were identified 
gene array technologies and immunological methods their co-overexpression in tumor sam 
was demonstrated. Surprisingly, by clustering tissue samples with HER-2/neu positive Tu 
samples, it was found that the expression pattern of this larger genomic region (consisting o 
genes) is very similar to control brain tissue. HER-2/neu negative breast tumor tissue did not s 
a similar expression pattern. Indeed, some of the genes within these cluster are important 
neural development (HER-2/neu, THRA) in mouse model systems or are described tc 
expressed in neural cells (NeuroD2). Moreover, by searching similar gene combinations in 
human and rodent genome additional homologous chromosomal regions on chromosome 3p21 
12ql3 harboring several isoforms of the respective genes (see below) were found. There w 
strong evidence for multiple interactions between the 43 candidate genes, as being part of idenl 
pathways (HER-2, neu, GRB7, CrkRS, CDC6), influencing the expression of each other (H 
2/neu, THRA, RARA), interacting with each other (PPARGBP, THRA, RARA, NR1D1 or H 
2/neu, GRB7) or expressed in defined tissues (CACNB1, PPARGBP, etc.). Interestingly, 
genomic regions of the ARCHEONs that were identified are amplified in acquired Tamox 
resistance of HER-2/neu negative cells (MCF7), which are normally sensitive to Tamox 
treatment [Achuthan et al., 2001,(2)]. 

Moreover, altered responsiveness to treatment due to the alterations of the genes within tl 
ARCHEONs was observed. Surprisingly, genes within the ARCHEONs are of importance eve 
the absence of HER-2/neu homologues. Some of the genes within the ARCHEONs, do not < 
30 serve as marker genes for prognostic purposes, but have already been known as targets 
therapeutic intervention. For example TOP2 alpha is a target of anthracyclins. THRA and KA 
can be targeted by hormones and hormone analogs (e.g. T3, rT3, RA). Due to their high affi 
binding sites and available screening assays (reporter assays based on their transcripts 
potential) the hormone receptors which are shown to be linked to neoplastic pathophysiology 
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the first time herein are ideal targets for drug screening and treatment of malignant neoplasia and 
breast cancer in particular. In this regard it is essential to know which members of the ARCHEON 
are altered in the neoplastic lesions. Particularly it is important to know the nature, number and 
extent to which the ARCHEON genes are amplified or deleted. The ARCHEONs are flanked by 
similar, endogenous retroviruses (e.g. HERV-K= "human endogenous retrovirus"), some of which 
are activated in breast cancer. These viruses may have also been involved in the evolution*,* 
duplication of the ARCHEONs. 

The analysis of the 17q21 region proved data obtained by fflC and identified several additional 
genes being co-amplified with the HER-2/neu gene. Comparative Analysis of RNA-based 
quantitative RT-PCR (TaqMan) with DNA-based qPCR from tumor cell lines identified the same 
amplified region. Genes at the 17ql 1 .2 -21. region are offered by way of illustration not by way of 
limitation. A graphical display of the described chromosomal region is provided in Figure 1. # 

mnlnvical rel<™<™™ of the e *r>*s which are part of the 1 7q2 1 ARCHEON 
MLN50 

By differential screening of cDNAs from breast cancer-derived metastatic axillary lymph nodes, 
TRAF4 and 3 other novel genes (MLN51, MLN62, MLN64) were identified that are 
overexpressed in breast cancer [Tomasetto et al., 1995, (3)]. One gene, which they designated 
MLN50, was mapped to 17qll-q21.3 by radioactive in situ hybridization. In breast cancer cell 
lines, overexpression of the 4 kb MLN50 mRNA was correlated with amplification of the gene and 
with amplification and overexpression of ERBB2, which maps to the same region. The authors 
suggested that the 2 genes belong to the same amplicon. Amplification of chromosomal region 
17ql l-q21 is one of the most common events occurring in human breast cancers. They report^ 
that the predicted 261-amino acid MLN50 protein contains an N-terminal LIM domain and a C- 
terminal SH3 domain. They renamed the protein LASP1, for -LIM and SH3 protein.' Northern blot 
analysis revealed that LASP1 mRNA was expressed at a basal level in all normal tissues examined 
and overexpressed in 8% of primary breast cancers. In most of these cancers, LASPl and ERBB2 
were simultaneously overexpressed. 

MLLT6 

The MLLT6 (AF17) gene encodes a protein of 1,093 amino acids, containing a leucine-zipper 
dimerization motif located 3-prime of the fusion point and a cysteine-rich domain at the end 
terminus. AF17 was found to contain stretches of amino acids previously associated with domains 
involved in transcriptional repression or activation. 
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Chromosome translocations involving band llq23 are associated with approximately 10 
patients with acute lymphoblastic leukemia (ALL) and more than 5% of patients with 
myeloid leukemia (AML). The gene at llq23 involved in the translocations is vari 
designated ALL1, HRX, MLL, and TRX1. The partner gene in one of the rarer translocai 
t(l I;17)(q23;q21), designated MLLT6 on 17ql2. 

ZNF144 (Mell8) 

Mel 18 cDNA encodes a novel cys-rich zinc finger motif. The gene is expressed strongly in 
tumor cell lines, but its normal tissue expression was limited to cells of neural origin and 
especially abundant in fetal neural cells. It belongs to a RING-finger motif family which inc 
BMI1. The MEL18/BMI1 gene family represents a mammalian homolog of the Droso 
'polycomb' gene group, thereby belonging to a memory mechanism involved in maintainin; 
the expression pattern of key regulatory factors such as Hox genes. Bmil, Mell8 and M33 g 
as representative examples of mouse Pc-G genes. Common phenotypes observed in knockout 
mutant for each of these genes indicate an important role for Pc-G genes not only in regulatii 
Hox gene expression and axial skeleton development but also in control of proliferation 
survival of haematopoietic cell lineages. This is in line with the observed prolifer 
deregulation observed in lymphoblastic leukemia. The MEL18 gene is conserved ar 
vertebrates. Its mRNA is expressed at high levels in placenta, lung, and kidney, and at lower \\ 
in liver, pancreas, and skeletal muscle. Interestingly, cervical and lumbo-sacral-HOX 
expression is altered in several primary breast cancers with respect to normal breast tissue wit 
HoxB gene cluster being present on 17q distal to the 17q21 locus. Moreover, dela; 
differentiation with persistent nests of proliferating cells was found in endothelial cells cocull 
with HOXB7-transduced SkBr3 cells, which exhibit a 17q21 amplification. Tumorigenicil 
these cells has been evaluated in vivo. Xenograft in athymic nude mice showed 
SkBr3/HOXB7 cells developed tumors with an increased number of blood vessels, e 
irradiated or not, whereas parental SkBr3 cells did not show any tumor take unless mice 
sublethally irradiated. As part of this invention, we have found MEL18 to be overexpre 
specifically in tumors bearing Her-2/neu gene amplification, which can be critical for 
expression. 

PHOSPHATIDYLINOSITOL-4-PHOSPHATE 5 -KINASE. T YPE IL BETA: PIP5K2B 

Phosphoinositide kinases play central roles in signal transduction. Phosphatidylinosit 
phosphate 5-kinases (PEP5Ks) phosphorylate phosphatidylinositol 4-phosphate, giving ris 
phosphatidylinositol 4,5-bisphosphate. The PDP5K enzymes exist as multiple isoforms that 
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various immunoreactivities, kinetic properties, and molecular masses. They are unique in that they 
possess almost no homology to the kinase motifs present in other phosphatidylinositol, protein, 
and lipid kinases. By screening a human fetal brain cDNA library with the PIP5K2B EST the full 
length gene could be isolated. The deduced 416-amino acid protein is 78% identical to PIP5K2A. 
5 Using SDS-PAGE, the authors estimated that bacterially expressed PIP5K2B has a molecular mass 
of 47 kD. Northern blot analysis detected a 6.3-kb PIP5K2B transcript which was abundantly 
expressed in several human tissues. PIP5K2B interacts specifically with the juxtamembrane region 
of the p55 TNF receptor (TNFR1) and PIP5K2B activity is increased in mammalian cells by 
treatment withf TNF-alpha. A modeled complex with membrane-bound substrate and ATP shows 

10 how a phosphoinositide kinase can phosphorylate its substrate in situ at the membrane 
interface.The substrate-binding site is open on 1 side, consistent with dual specificity for 
phosphatidylinositol 3- and 5-phosphates. Although the amino acid sequence of PEP5K2A does not 
show homology to known kinases, recombinant PIP5K2A exhibited kinase activity. PTP5K2^ 
contains a putative Src homology 3 (SH3) domain-binding sequence. Overexpression of mouse 

15 PIP5K1B in COS7 cells induced an increase in short actin fibers and.a decrease in actin stress 
fibers. 

TEM7 

Using serial analysis of gene expression (SAGE) a partial cDNAs corresponding to several tumor 
endothelial markers (TEMs) that displayed elevated expression during tumor angiogenesis could 

20 be identified. Among the genes identified was TEM7. Using database searches and 5-prime RACE 
the entire TEM7 coding region, which encodes a 500-amino acid type I transmembrane protein,has 
been described.. The extracellular region of TEM7 contains a plexin-like domain and has weak 
homology to the ECM protein nidogen. The function of these domains, which are usually found^p 
secreted and extracellular matrix molecules, is unknown. Nidogen itself belongs to the entactin 

25 protein family and helps to determine pathways of migrating axons by switching from 
circumferential to longitudinal migration. Entactin is involved in cell migration, as it promotes 
trophoblast outgrowth through a mechanism mediated by the RGD recognition site, and plays an 
important role during invasion of the endometrial basement membrane at implantation. As entactin 
promotes thymocyte adhesion but affects thymocyte migration only marginally, it is suggested that 

30 entactin may plays a role in thymocyte localization during T cell development. 

In situ hybridization analysis of human colorectal cancer demonstrated that TEM7 was expressed 
clearly in the endothelial cells of the tumor stroma but not in the endothelial cells of normal 
colonic tissue. Using in situ hybridization to assay expression in various normal adult mouse 
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tissues, they observed that TEM7 was largely undetectable in mouse tissues or tumors, but was 
abundantly expressed in mouse brain. 

ZNFNJA3 

By screening a B-cell cDNA library with a mouse Aiolos N-terminal cDNA probe, a cDNA 
5 encoding human Aiolos, or ZNFN1A3, was obtained. The deduced 509-amino acid protein, which 
is 86% identical to its mouse counterpart, has 4 DNA-binding zinc fingers in its N terminus and 2 
zinc fingers that mediate protein dimerization in its C terminus. These domains are 100% and 96% 
homologous to the corresponding domains in the mouse protein, respectively. Northern blot 
analysis revealed strong expression of a major 11.0- and a minor 4.4-kb ZNFN1A3 transcript in 
10 peripheral blood leukocytes, spleen, and thymus, with lower expression in liver, small intestine, 
and lung. 

Ikaros (ZNFN1A1), a hemopoietic zinc finger DNA-binding protein, is a central regulator of 
lymphoid differentiation and is implicated in leukemogenesis. The execution of normal function of 
Ikaros requires sequence-specific DNA binding, transactivation, and dimerization domains. Mice 

15 with a mutation in a related zinc finger protein, Aiolos, are prone to B-cell lymphoma. In 
chemically induced murine lymphomas allelic losses on markers surrounding the Znfhlal gene 
were detected in 27% of the tumors analyzed. Moreover specific Ikaros expression was in primary 
mouse hormone-producing anterior pituitary cells and substantial for Fibroblast growth factor 
receptor 4 (FGFR4) expression, which itself is implicated in a multitude of endocrine cell 

20 hormonal and proliferative properties with FGFR4 being differentially expressed in normal and 
neoplastic pituitary. Moreover Ikaros binds to chromatin remodelling complexes containing 
SWI/SNF proteins, which antagonize Polycomb function. Intetrestingly at the telomeric end of the 
disclosed ARCHEON the SWI/SNF complex member SMARCE1 (= SWI/SNF-related, matrix- 
associated, actin-dependent regulators of chromatin) is located and part of the described 

25 amplification. Due to the related binding specificities of Ikaros and Palindrom Binding Protein 
(PBP) it is suggestive, that ZNFN1 A3 is able to regulate the Her-2/neu enhancer. 

PPPIR1B 

Midbrain dopaminergic neurons play a critical role in multiple brain functions, and abnormal 
signaling through dopaminergic pathways has been implicated in several major neurologic and 
30 psychiatric disorders. One well-studied target for the actions of dopamine is DARPP32. In the 
densely dopamine- and glutamate-innervated rat caudate-putamen, DARPP32 is expressed in 
medium-sized spiny neurons that also express dopamine Dl receptors. The function of DARPP32 
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seems to be regulated by receptor stimulation. Both dopaminergic and glutamatergic (NMDA) 
receptor stimulation regulate the extent of DARPP32 phosphorylation, but in opposite directions. 

The human DARPP32 was isolated from a striatal cDNA library. The 204-amino acid DARPP32 
protein shares 88% and 85% sequence identity, respectively, with bovine and rat DARPP32 
5 proteins. The DARPP32 sequence is particularly conserved through the N terminus, which 
represents the active portion of the protein. Northern blot analysis demonstrated that the 2.1-kb 
DARPP32 mRNA is more highly expressed in human caudate than in cortex. In situ hybridization 
to postmortem human brain showed a low level of DARPP3i expression in all neocortical layers, 
with the strongest hybridization in the superficial layers. CDK5 phosphorylated DARPP32 in vitro 

10 and in intact brain cells. Phospho-thr75 DARPP32 inhibits PKA in vitro by a competitive 
mechanism. Decreasing phospho-thr75 DARPP32 in striatal cells either by a CDK5-specific 
inhibitor or by using genetically altered mice resulted in increased dopamine-induci^ 
phosphorylation of PKA substrates and augmented peak voltage-gated calcium currents. Thus, 
DARPP32 is a Afunctional signal transduction molecule which, by- distinct -mechanisms, controls a 

1 5 serine/threonine kinase and a serine/threonine phosphatase. 

DARPP32 and t-DARPP are overexpressed in gastric cancers. It's suggested that overexpression of 
these 2 proteins in gastric cancers may provide an important survival advantage to neoplastic cells. 
It could be demonstrated that Darpp32 is an obligate intermediate in progesterone-facilitated 
sexual receptivity in female rats and mice. The facilitative effect of progesterone on sexual 
20 receptivity in female rats was blocked by antisense oligonucleotides to Darpp32. Homozygous 
mice carrying a null mutation for the Darpp32 gene exhibited minimal levels of progesterone- 
facilitated sexual receptivity when compared to their wildtype littermates, and progesterone 
significantly increased hypothalamic cAMP levels and cAMP-dependent protein kinase activity^^ 

CACNB1 

25 In 1991 a cDNA clone encoding a protein with high homology to the beta subunit of the rabbit 
skeletal muscle dihydropyridine-sensitive calcium channel from a rat brain cDNA library [Pragnell 
et al., 1991, (4)]. This rat brain beta-subunit cDNA hybridized to a 3.4-kb message that was 
expressed in high levels in the cerebral hemispheres and hippocampus and much lower levels in 
cerebellum. The open reading frame encodes 597 amino acids with a predicted mass of 65,679 Da 

30 which is 82% homologous with the skeletal muscle beta subunit. The corresponding human beta- 
subunit gene was localized to chromosome 17 by analysis of somatic cell hybrids. The authors 
suggested that the encoded brain beta subunit, which has a primary structure highly similar to its 
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isoform in skeletal muscle, may have a comparable role as an integral regulatory component ol 
neuronal calcium channel. 



RPL19 



The ribosome is the only organelle conserved between prokaryotes and eukaryotes. In eukaryote 
this organelle consists of a 60S large subunit and a 40S small subunit. The mammalian ribosorr 
contains 4 species of RNA and approximately 80 different ribosomal proteins, most of whic 
appear to be present in equimolar amounts. In mammalian cells, ribosomal proteins can accou, 
for up to 15% of the total cellular protein, and the expression of the different ribosomal protei 
genes, which can account for up to 7 to 9% of the total cellular mRNAs, is coordinate* regulate 
to meet the cell's varying requirements for protein synthesis. The mammalian ribosomal protei 
genes are members of multigene families, most of which are composed of multiple processe, 
pseudogenes and a single functional intron-containing gene. The presence of multiple pseudogene 
hampered the isolation and study of the functional ribosomal protein genes. By study of somati, 
cell hybrids, it has been elucidated that DNA sequences complementary to 6 mammaliai 
nbosomal protein cDNAs could be assigned to chromosomes 5, 8, and 17. Ten fragments rappee 
to 3 chromosomes [Nakamichi et al., 1986, (5)]. These are probably a mixture of funcriona: 
(expressed) genes and pseudogenes. One that maps to 5q23-q33 rescues Chinese hamster emetine- 
resistance mutations in interspecies hybrids and is therefore the transcriptionally active RPS14 
gene. In 1989 a PCR-based strategy for the detection of intron-containing genes in the presence of 
mumple pseudogenes was described. This technique was used to identify the intron-containing 
PCR products of 7 human ribosomal protein genes and to map their chromosomal locations by 
hybndization to human/rodent somatic cell hybrids [Feo et al., 1992, (6)]. All 7 ribosomal protein 
genes were found to be on different chromosomes: RPL19 on 17pl2-ql 1;RPL30 on 8; RPL35A on 
18; RPL36A on 14; RPS6 on 9pter-pl3; RPS11 on 19cen-qter; and RPS17 on llpter-pl3. These 
are also different sites from the chromosomal location of previously mapped ribosomal protein 
genes S14 on chromosome 5, S4 on Xq and Yp, and RP1 17A on 9q3-q34. By fluorescence in situ 
hybruhzation the position of the RPL19 gene was mapped to 17ql 1 [Davies et al., 1989, (7)]. 

PPARBP, PHP QRSPt. CRSP200 TRIP2. TRAP?™ RB18A nRTP^n 

The thyroid hormone receptors (TRs) are hormone-dependent transcription factors that regulate 
expression of a variety of specific target genes. They must specifically interact with a number of 
proteins as they progress from their initial translation and nuclear translocation to 
heterodimerization with retinoid X receptors (RXRs), functional interactions with other 
transcription factors and the basic transcriptional apparatus, and eventually, degradation. To help 
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elucidate the mechanisms that underlie the transcriptional effects and other potential functions of 
TRs, the yeast interaction trap, a version of the yeast 2-hybrid system, was used to identify 
proteins that specifically interact with the ligand-binding domain of rat TR-beta-1 (THRB) [Lee et 
al., 1995, (8)]. The authors isolated HeLa cell cDNAs encoding several different TR-interacting 
proteins (TRIPs), including TRIP2. TRIP2 interacted with rat Thrb only in the presence of thyroid 
hormone. It showed a ligand-independent interaction with RXR-alpha, but did not interact with the 
glucocorticoid receptor (NR3C1) under any condition. By immunoscreening a human B- 
lymphoma cell cDNA expression library with the anti- P 53 monoclonal antibody PAM801, 
PPARBP was identified, which was called RB18A for 'recognized by PAM80T monoclonal 
antibody' [Drone et al., 1997, (9)]. The predicted 1,566-amino acid RB18A protein contains 
several potential nuclear localization signals, 13 potential N-glycosylation sites, and a high 
number of potential phosphorylation sites. Despite sharing common antigenic determinants with^ 
p53, RB18A does not show significant nucleotide or amino acid sequence similarity with p53^ 
Whereas the calculated molecular mass of RB18A is 166 kD, the apparent mass of recombinant 
RBI 8 A was 205 kD by SDS-PAGE analysis, the authors demonstrated that RB18A shares 
functional properties with p53, including DNA binding, p53 binding, and self-oligomerization. 
Furthermore, RBI 8 A was able to activate the sequence-specific binding of p53 to DNA which 
was induced through an unstable interaction between both proteins. Northern blot analysis of 
human tissues detected an 8.5-kb RB18A transcript in all tissues examined except kidney, with 
highest expression in heart. Moreover mouse Pparbp, which was called Pbp for 'Ppar-binding 
protein,' as a protein that interacts with the Ppar-gamma (PPARG) ligand-binding domain in a 
yeast 2-hybrid system was identified [Zhu et al., 1997, (10)]. The authors found that Pbp also 
binds to PPAR-alpha (PPARA), RAR-alpha (RARA), RXR, and TR-beta-1 in vitro. The binding 
of Pbp to these receptors increased in the presence of specific ligands. Deletion of the last ^ 
amino acids from the C terminus of PPAR-gamma resulted in the abolition of interaction between 
n>p and PPAR-gamma. Fbp modestly increased the transcriptional activity of PPAR-gamma, and 
a truncated form of Pbp acted as a dominant-negative repressor, suggesting that Pbp is a genuine 
transcriptional co-activator for PPAR The predicted 1,560-amino acid Pbp protein contains 2 
LXXLL motifs, which are considered necessary and sufficient for the binding of several co- 
activators to nuclear receptors. Northern blot analysis detected Pbp expression in all mouse tissues 
examined, with higher levels in liver, kidney, lung, and testis. In situ hybridization showed that 
Pbp is expressed during mouse ontogeny, suggesting a possible role for Pbp in cellular 
proliferation and differentiation. In adult mouse, in situ hybridization detected Pbp expression in 
liver, bronchial epithelium in the lung, intestinal mucosa, kidney cortex, thymic cortex, splenic 
follicles, and seminiferous epithelium in testis. Lateron PPARBP was identified, which was called 



BHC03 1 051 



-23- 

TRAP220, from an immunopurified TR-alpha (THRA)-TRAP complex [Yuan et aL, 1998, (11)] 
The authors cloned Jurkat cell cDNAs encoding TRAP220. The predicted 1,581-amino acic 
TRAP220 protein contains LXXLL domains, which are found in other nuclear receptor-interacting 
proteins. TRAP220 is nearly identical to RB18A , with these proteins differing primarily by an 
extended N terminus on TRAP220. In the absence of TR-alpha, TRAP220 appears to reside in a 
single complex with other TRAPs. TRAP220 showed a direct ligand-dependent interaction with 
TR-alpha, which was mediated through the C terminus of TR-alpha and, at least in part, the 
LXXLL domains of TRAP220. TRAP220 also interacted with other nuclear receptors, including 
vitamin D receptor, RARA, RXRA, PPARA, PPARG, and estrogen receptor-alpha (ESR1; 
133430), in a ligand-dependent manner. TRAP220 moderately stimulated human TR-alpha- 
mediated transcription in transfected cells, whereas a fragment containing the LXXLL motifs 
acted as a dominant-negative inhibitor of nuclear receptor-mediated transcription both in 
transfected cells and in cell-free transcription systems. Further studies indicated that TRAP220 
plays a major role in anchoring other TRAPs to TR-alpha during the function of the TR-alpha- 
TRAP complex and that TRAP220 may be a global co-activator for the nuclear receptor 
superfamily. PBP, a nuclear receptor co-activator, interacts with estrogen receptor-alpha (ESR1) in 
the absence of estrogen. This interaction was enhanced in the presence of estrogen, but was 
reduced in the presence of the anti-estrogen Tamoxifen. Transfection of PBP into cultured cells 
resulted in enhancement of estrogen-dependent transcription, indicating that PBP serves as a co- 
activator in estrogen receptor signaling. To examine whether overexpression of PBP plays a role in 
breast cancer because of its co-activator function in estrogen receptor signaling, the levels of PBP 
expression in breast tumors was determined [Zhu et al., 1999, (12)]. High levels of PBP expression 
were detected in approximately 50% of primary breast cancers and breast cancer cell lines by 
ribonuclease protection analysis, in situ hybridization, and immunoperoxidase staining. By using 
FISH, the authors mapped the PBP gene to 17ql2, a region that is amplified in some breast 
cancers. They found PBP gene amplification in approximately 24% (6 of 25) of breast tumors and 
approximately 30% (2 of 6) of breast cancer cell lines, implying that PBP gene overexpression can 
occur independent of gene amplification. They determined that the PBP gene comprises 17 exons 
that together span more than 37 kb. Their findings, in particular PBP gene amplification, suggested 
that PBP, by its ability to function as an estrogen receptor-alpha co-activator, may play a role in 
mammary epithelial differentiation and in breast carcinogenesis. 

NEUROD2 



Basic helix-loop-helix (bHLH) proteins are transcription factors involved in determining cell type 
during development. In 1995 a bHLH protein was described, termed NeuroD (for 'neurogenic 
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differentiatioiV), that functions during neurogenesis [Lee et al., 1995, (13)]. The human NEUROD 
gene maps to chromosome 2q32. The cloning and characterization of 2 additional NEUROD 
genes, NEUROD2 and NEUROD3 was described in 1996 [McCormick et al. f 1996, (14)]. 
Sequences for the mouse and human homologues were presented. NEUROD2 shows a high degree 
of homology to the bHLH region of NEUROD, whereas NEUROD3 is more distantly related. The 
authors found that mouse neuroD2 was initially expressed at embryonic day 1 1, with persistent 
expression in the adult nervous system. Similar to neuroD, neuroD2 appears to mediate neuronal 
differentiation. The human NEUROD2 was mapped to 17ql2 by fluorescence in situ hybridization 
and the mouse homologue to chromosome 11 [Tamimi et al., 1997, (15)]. 



TELETHONIN 

Telethonin is a sarcomeric protein of 19 kD found exclusively in striated and cardiac muscle Ii 
appears to be localized to the Z disc of adult skeletal muscle and cultured myocytes. Telethonin i: 
a substrate of titin, which acts as a molecular 'ruler 1 for the assembly of the sarcomere by 
providing " spatially defined binding sites for other sarcomeric proteins. After activation by 
phosphorylation and calcium/calmodulin binding, titin phosphorylates the C-terminal domain of 
telethonin in early differentiating myocytes. The telethonin gene has been mapped to 17ql2, 
adjacent to the phenylethanolamine N-methyltransferase gene [Valle et al., 1997, (16)]. 

PENT. PNMT 

Phenylethanolamine N-methyltransferase catalyzes the synthesis of epinephrine from 
norepinephrine, the last step of catecholamine biosynthesis. The cDNA clone was first isolated in 
1998 for bovine adrenal medulla PNMT using mixed oligodeoxyribonucleotide probes whose 
synthesis was based on the partial amino acid sequence of tryptic peptides from the bovine enzy^^^ 
[Kaneda et al., 1988, (17)]. Using a bovine cDNA as a probe, the authors screened a human 
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contained a complete coding region of the enzyme. Northern blot analysis of human 
pheochromocytoma polyadenylated RNA using this cDNA insert as the probe demonstrated a 
single RNA species of about 1,000 nucleotides, suggesting that this clone is a full-length cDNA. 
The nucleotide sequence showed that human PNMT has 282 amino acid residues with a predicted 
molecular weight of 30,853, including the initial methionine. The amino acid sequence was 88% 
homologous to that of bovine enzyme. The PNMT gene was found to consist of 3 exons and 2 
introns spanning about 2,100 basepairs. It was demonstrated that in transgenic mice the gene is 
expressed in adrenal medulla and retina. A hybrid gene consisting of 2 kb of the PNMT 5-prime- 
flanking region fused to the simian virus 40 early region also resulted in tumor antigen mRNA 
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expression in adrenal glands and eyes; furthermore, immunocytochemistry showed that the tu 
antigen was localized in nuclei of adrenal medullary cells and cells of the inner nuclear cell I 
of the retina, both prominent sites of epinephrine synthesis. The results indicate that 
enhancer(s) for appropriate expression of the gene in these cell types are in the 2-kb 5-pri 
flanking region of the gene. 

Kaneda et ah, 1988 (17), assigned the human PNMT gene to chromosome 17 by Southern 
analysis of DNA from mouse-human somatic cell hybrids. In 1992 the localization was narro 
down to 17q21-q22 by linkage analysis using RFLPs related to the PNMT gene and several 
DNA markers [Hoehe et al., 1992, (18)]. The findings are of interest in light of the descriptior 
a genetic locus associated with blood pressure regulation in the stroke-prone spontaneo 
hypertensive rat (SHR-SP) on rat chromosome 10 in a conserved linkage synteny group cc 
sponding to human chromosome 17q22-q24. See essential hypertension . 

MGC9753 

This gene maps on chromosome 17, at 17ql2 according to RefSeq. It is expressed at very 1 
level. It is defined by cDNA clones and produces, by alternative splicing, 7 different transci 
can be obtained (SEQ ID NO.60 to 66 and 83 to 89 /Table 1), altogether encoding 7 diffe 
protein isoforms. Of specific interest is the putatively secreted isoform g, encoded by a mRN; 
2.55 kb. Ifs premessenger covers 16.94 kb on the genome. It has a very long 3' UTR. . the pro 
(226 aa, MW 24.6 kDa, pi 8.5) contains no Pfam motif. The MGC9753 gene produces, 
alternative splicing, 7 types of transcripts, predicted to encode 7 distinct proteins. It contain' 
confirmed introns, 10 of which are alternative. Comparison to the genome sequence shows tha 
introns follow the consensual [gt-ag] rule, 1 is atypical with good support [tg_cg]. The six n 
abundant isoforms are designated by a) to i) and code for proteins as follows: 

a) This mRNA is 3.03 kb long, its premessenger covers 16.95 kb on the genome. It has a * 
long 3' UTR. The protein (190 aa, MW 21.5 kDa, pi 7.2) contains no Pfam motif. : 
predicted to localise in the endoplasmic reticulum. 

c) This mRNA is 1.17 kb long, its premessenger covers 16.93 kb on the genome. It ma: 
incomplete at the N terminus. The protein (368 aa, MW 41.5 kDa, pi 7.3) contains 
Pfam motif. 

d) This mRNA is 3.17 kb long, its premessenger covers 16.94 kb on the genome. It has a ' 
long 3* UTR and 5*p UTR. . The protein (190 aa, MW 21.5 kDa, pi 7.2) contains no P 
motif. It is predicted to localise in the endoplasmic reticulum. 
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g) This mRNA is 2.55 kb long, its premessenger covers 1 6.94 kb on the genome. It has a very 
long 3' UTR. . The protein (226 aa, MW 24.6 kDa, pi 8.5) contains no Pfam motif. It is 
predicted to be secreted. 

h) This mRNA is 2.68 kb long, its premessenger covers 16.94 kb on the genome. It has a very 
long 3' UTR. . The protein (320 aa, MW 36.5 kDa, pi 6.8) contains no Pfam motif. It is 
predicted to localise in the endoplasmic reticulum. 

i) This mRNA is 2.34 kb long, its premessenger covers 16.94 kb on the genome. It may be 
incomplete at the N terminus. It has a very long 3' UTR. . The protein (217 aa, MW 24.4 
kDa, pi 5.9) contains no Pfam motif. 

The MCG9753 gene may be homologue to the CAB2 gene located on chromosome 17ql2. The 
CAB2, a human homologue of the yeast COS16 required for the repair of DNA double-stran» 
breaks was cloned. Autofluorescence analysis of cells transfected with its GFP fusion protein 
demonstrated that CAB2 translocates Into vesicles, suggesting that overexpression of CAB2 may., 
decrease intercellular Mn- 

(2 +) by accumulating it in the vesicles, in the same way as yeast. 
Her-2/neu. MtRRl NGL. TKR1 

The oncogene originally called NEU was derived from rat neuro/glioblastoma cell lines. It encodes 
a tumor antigen, P 185, which is serologically related to EGFR the epidermal growth factor 
receptor. EGFR maps to chromosome 7. Inl985 it was found, that the human homologue, which 
they designated NGL (to avoid confusion with neuraminidase, which is also symbolized NEU* 
maps to 17ql2-q22 by in situ hybridization and to 17q21-qter in somatic cell hybrids [Yang-Fe* 
*t al.. 1985. (19)]. Thus, the SRO is 17q21-q22. Moreover, inl985 a potential cell surface receptor 
of the tyrosine kinase gene family was identified and characterized by cloning the gene [Coussens 
et al., 1985, (20)]. Its primary sequence is very similar to that of the human epidermal growth 
factor receptor. Because of the seemingly close relationship to the human EGF receptor, the 
authors called the gene HER2. By Southern blot analysis of somatic cell hybrid DNA and by in 
situ hybridization, the gene was assigned to 17q21-q22. This chromosomal location of the gene is 
coincident with the NEU oncogene, which suggests that the 2 genes may in fact be the same; 
indeed, sequencing indicates that they are identical. Inl988 a correlation between overexpression 
of NEU protein and the large-cell, comedo growth type of ductal carcinoma was found [van de 
Vijver et al., 1988, (21)]. The authors found no correlation, however, with lymph-node status or 
tumor recurrence. The role of HER2/NEU in breast and ovarian cancer was described in 1989, 
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which together account for one-third of all cancers in women and approximately one-quartei 
cancer-related deaths in females [Slamon etal., 1989, (22)]. 

An ERBB-related gene that is distinct from the ERBB gene, called ERBB1 was found in 19 
ERBB2 was not amplified in vulva carcinoma cells with EGFR amplification and did not re 
with EOF receptor mRNA. About 30-fold amplification of ERBB2 was observed in a hurr 
adenocarcinoma of the salivary gland. By chromosome sorting combined with veloc 
sedimentation and Southern hybridization, the ERBB2 gene was assigned to chromosome 
[Fukushige et al.,1986, (23)]. By hybridization to sorted chromosomes and to metaphase sprea 
with a genomic probe, they mapped the ERBB2 locus to 17q21. This is the chromosome 
breakpoint in acute promyelocyte leukemia (APL). Furthermore, they observed amplification ai 
elevated expression of the ERBB2 gene in a gastric cancer cell line. Antibodies against a synthef 
peptide corresponding to 14 amino acid residues at the COOH-terminus of a protein deduced fro 
the ERBB2 nucleotide sequence were raised in 1986. With these antibodies, the ERBB2 ge. 
product from adenocarcinoma cells was precipitated and demonstrated to be a 185-k 
glycoprotein with tyrosine kinase activity. A cDNA probe for ERBB2 and by in situ hybridizatic 
to APL cells with a 15;17 chromosome translocation located the gene to the proximal side of fr 
breakpoint [Kaneko et al., 1987, (24)]. The authors suggested that both the gene and the breakpoi, 
are located in band 17 q 21.1 and, further, that the ERBB2 gene is involved in the development c 
leukemia. In 1987 experiments indicated that NEU and HER2 are both the same as ERBB2 [E 
Fiore et al., 1987, (25)]. The authors demonstrated that overexpression alone can converge gen 
for a normal growth factor receptor, namely, ERBB2, into an oncogene. The ERBB2 to 17ql l- q 2 
by in situ hybridization [Popescu et al., 1989, (26)]. By in situ hybridization to chromosome: 
denved from fibroblasts carrying a constitutional translocation between 15 and 17, they showe< 
that the ERBB2 gene was relocated to the derivative chromosome 15; the gene can thus b< 
localized to 17ql2-q21.32. By family linkage studies using multiple DNA markers in the 17ql 2 . 
q21 region the ERBB2 gene was placed on the genetic map of the region. 

mterleukin-6 is a cytokine that was initially recognized as a regulator of immune and 
mflammatory responses, but also regulates the growth of many tumor cells, including prostate 
cancer. Overexpression of ERBB2 and ERBB3 has been implicated in the neoplastic 
transformation of prostate cancer. Treatment of a prostate cancer cell line with IL6 induced 
tyrosine phosphorylation of ERBB2 and ERBB3, but not ERBB 1 /EGFR. The ERBB2 forms a 
complex with the gpl30 subunit of the 1L6 receptor in an IL6-dependent manner. This association 
was miportant because the inhibition of ERBB2 activity resulted in abrogation of IL6-induced 
MAPK activation. Thus, ERBB2 is a critical component of IL6 signaling through the MAP kinase 
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pathway [Qiu et al., 1998, (27)]. These findings showed how a cytokine receptor can diversify its 
signaling pathways by engaging with a growth factor receptor kinase. 

Overexpression of ERBB2 confers Taxol resistance in breast cancers. Overexpression of ERBB2 
inhibits Taxol-induced apoptosis [Yu et al., 1998, (28)]. Taxol activates CDC2 kinase in MDA- 
MB-435 breast cancer cells, leading to cell cycle arrest at the G2/M phase and, subsequently, 
apoptosis. A chemical inhibitor of CDC2 and a dominant-negative mutant of CDC2 blocked 
Taxol-induced apoptosis in these cells. Overexpression of ERBB2 in MDA-MB-435 cells by 
transfection transcriptionally upregulates CDKN1A which associates with CDC2, inhibits Taxol- 
mediated CDC2 activation, delays cell entrance to G2/M phase, and thereby inhibits Taxol- 
) induced apoptosis. In CDKN1 A antisense-transfected MDA-MB-435 cells or in p21-/- MEF cells, 
ERBB2 was unable to inhibit Taxol-induced apoptosis. Therefore, CDKN1A participates in the 
regulation of a G2/M checkpoint that contributes to resistance to Taxol-induced apoptosis # 
ERBB2-overexpressing breast cancer cells. 

A secreted protein of approximately 68 kD was described, designated herstatin, as the product of 
5 an alternative ERBB2 transcript that retains intron 8 [Doherty et al., 1999, (29)]. This alternative 
transcript specifies 340 residues identical to subdomains I and II from the extracellular domain of 
pl85ERBB2, followed by a unique C-terminal sequence of 79 amino acids encoded by intron 8. 
The recombinant product of the alternative transcript specifically bound to ERBB2-transfected 
cells and was chemically crosslinked to p!85ERBB2, whereas the intron-encoded sequence alone 
>0 also bound with high affinity to transfected cells and associated with pl85 solubilized from cell 
extracts. The herstatin mKNA was expressed in normal human fetal kidney and liver, but was at 
reduced levels relative to P 185ERBB2 mRNA in carcinoma cells that contained an amplified 
ERBB2 gene. Herstatin appears to be an inhibitor of pl85ERBB2, because it disrupts dim^ 
reduces tyrosine phosphorylation of P 185, and inhibits the anchorage-independent growth of 

25 transformed cells that overexpress fcKauz. inc xxi^ « 

overexpressed in 25 to 30% of breast cancers, increasing the aggressiveness of the tumor. Finally, 
it was found that a recombinant monoclonal antibody against HER2 increased the clinical benefit 
of first-line chemotherapy in metastatic breast cancer that overexpresses HER2 [Slamon et al., 
2001,(30)]. 

30 GRB7 

Growth factor receptor tyrosine kinases (GF-RTKs) are involved in activating the cell cycle. 
Several substrates of GF-RTKs contain Src-homology 2 (SH2) and SH3 domains. SH2 domain- 
containing proteins are a diverse group of molecules important in tyrosine kinase signaling. Using 
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the CORT (cloning of receptor targets) method to screen a high expression mouse library, the 
gene for murine Grb7, which encodes a protein of 535 amino acids, was isolated [Margolis et al., 
1992, (31)]. GRB7 is homologous to ras-GAP (ras-GTPase-activating protein). It contains an SH2 
domain and is highly expressed in liver and kidney. This gene defines the GRB7 family, whose 
members include the mouse gene GrblO and the human gene GRB14. 

A putative GRB7 signal transduction molecule and a GRB7V novel splice variant from an 
invasive human esophageal carcinoma was isolated [Tanaka et al., 1998, (32)]. Although both 
GRB7 isoforms shared homology with the Mig-10 cell migration gene of Caenorhabditis elegans, 
the GRB7V isoform lacked 88 basepairs in the C terminus; the resultant frameshift led to 
substitution of an SH2 domain with a short hydrophobic sequence. The wildtype GRB7 protein, 
but not the GRB7V isoform, was rapidly tyrosyl phosphorylated in response to EGF stimulation in 
esophageal carcinoma cells. Analysis of human esophageal tumor tissues and regional lymph 
nodes with metastases revealed that GRB7V was expressed in 40% of GRB7-positive esophageal 
carcinomas. GRB7V expression was enhanced after metastatic spread to lymph nodes as compared 
to the original tumor tissues. Transfection of an antisense GRB7 RNA expression construct 
lowered endogenous GRB7 protein levels and suppressed the invasive phenotype exhibited by 
esophageal carcinoma cells. These findings suggested that GRB7 isoforms are involved in cell 
invasion and metastatic progression of human esophageal carcinomas. By sequence analysis, The 
GRB7 gene was mapped to chromosome 17q21-q22, near the topoisomerase-2 gene [Dong et al., 
1997, (33)]. GRB-7 is amplified in concert with HER2 in several breast cancer cell lines and that 
GRB-7 is overexpressed in both cell lines and breast tumors. GRB-7, through its SH2 domain, 
binds tightly to HER2 such that a large fraction of the tyrosine phosphorylated HER2 in SKBR-3 
cells is bound to GRB-7 [Stein et al., 1994, (34)]. 

GCSF. CSF3 

Granulocyte colony-stimulating factor (or colony stimulating factor-3) specifically stimulates the 
proliferation and differentiation of the progenitor cells for granulocytes. The partial amino acid 
sequence of purified GCSF protein was determined, and by using oligonucleotides as probes, 
several GCSF cDNA clones were isolated from a human squamous carcinoma cell line cDNA 
library [Nagata et al., 1986, (35)]. Cloning of human GCSF cDNA shows that a single gene codes 
for a 177- or 180-amino acid mature protein of molecular weight 19,600. The authors found that 
the GCSF gene has 4 introns and that 2 different polypeptides are synthesized from the same gene 
by differential splicing of mRNA. The 2 polypeptides differ by the presence or absence of 3 amino 
acids. Expression studies indicate that both have authentic GCSF activity. A stimulatory activity 
from a glioblastoma multiform cell line being biologically and biochemically indistinguishable 
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from GCSF produced by a bladder cell line was found in 1987. By somatic cell hybridization and 
in situ chromosomal hybridization, the GCSF gene was mapped to 17qll in the region of the 
breakpoint in the 15; 17 translocation characteristic of acute promyelocyte leukemia [Le Beau et 
al., 1987, (36)]. Further studies indicated that the gene is proximal to the said breakpoint and that it 
remains on the rearranged chromosome 17. Southern blot analysis using both conventional and 
pulsed field gel electrophoresis showed no rearranged restriction fragments. By use of a full-length 
cDNA clone as a hybridization probe in human-mouse somatic cell hybrids and in flow-sorted 
human chromosomes, the gene for GCSF was mapped to 17q21-q22 lateron 

THRA. THRAL ERBA. EAR7. ERBA2. ERBA3 

Both human and mouse DNA have been demonstrated to have two distantly related classes of 
ERBA genes and that in the human genome multiple copies of one of the classes exist [Jansson^ 
al., 1983, (37)]. A cDNA was isolated derived from rat brain messenger RNA on the basis of 
homology to the human thyroid receptor gene [Thompson et al., 1987, (38)]. Expression of this 
cDNA produced a high-affinity binding protein for thyroid hormones. Messenger RNA from this 
gene was expressed in tissue-specific fashion, with highest levels in the central nervous system 
and no expression in the liver. An increasing body of evidence indicated the presence of multiple 
thyroid hormone receptors. The authors suggested that there may be as many as 5 different but 
related loci. Many of the clinical and physiologic studies suggested the existence of multiple 
receptors. For example, patients had been identified with familial thyroid hormone resistance in 
which peripheral response to thyroid hormones is lost or diminished while neuronal functions are 
maintained. Thyroidologists recognize a form of cretinism in which the nervous system is severely 
affected and another form in which the peripheral functions of thyroid hormone are more 
dramatically affected. 

The cDNA encoding a specific form of thyroid hormone receptor expressed in human liver, 
kidney, placenta, and brain was isolated [Nakai et al., 1988, (39)]. Identical clones were found in 
human placenta. The cDNA encodes a protein of 490 amino acids and molecular mass of 54,824. 
Designated thyroid hormone receptor type alpha-2 (THRA2), this protein is represented by 
mRNAs of different size in liver and kidney, which may represent tissue-specific processing of the 
primary transcript. 

The THRA gene contains 10 exons spanning 27 kb of DNA. The last 2 exons of the gene are 
alternatively spliced. A 5-kb THRA1 mRNA encodes a predicted 410-amino acid protein; a 2.7-kb 
THRA2 mRNA encodes a 490-amino acid protein. A third isoform, TR-alpha-3, is derived by 
alternative splicing. The proximal 39 amino acids of the TH-alpha-2 specific sequences are deleted 
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in TR-alpha-3. A second gene, THRB on chromosome 3, encodes 2 isoforms of TR-beta 1 
alternative splicing. Inl989the structure and function of the EAR1 and EAR7 genes w 
elucidated, both located on 17q21 [Miyajima et al., 1989, (40)]. The authors determined that o. 
of the exons in the EAR7 coding sequence overlaps an exon of EAR1, and that the 2 genes a 
transcribed from opposite DNA strands. In addition, the EAR7 mRNA generates 2 alternative 
spliced isoforms, referred to as EAR71 and EAR72, of which the EAR71 protein is the huma 
counterpart of the chicken c-erbA protein. 

The thyroid hormone receptors, beta, alpha-1, and alpha-2 3 mRNAs are expressed in all tissue 
examined and the relative amounts of the three mRNAs were roughly parallel. None of the 
mRNAs was abundant in liver, which is the major thyroid hormone-responsive organ. This led t 
the assumption that another thyroid hormone receptor may be present in liver. It was found tha 
ERBA, which potentiates ERBB, has an amino acid sequence different from that of other know, 
oncogene products and related to those of the carbonic anhydrases [Debuire et al., 1984 (41)] 
ERBA potentiates ERBB by blocking differentiation of erythroblasts at an immature stage 
Carbonic anhydrases participate in the transport of carbon dioxide in erythrocytes. In 1986 it wa< 
shown that the ERBA protein is a high-affinity receptor for thyroid hormone. The cDNA sequence 
md,cates a relationship to steroid-hormone receptors, and binding studies indicate that it is a 
receptor for thyroid hormones. It is located in the nucleus, where it binds to DNA and activates 
transcription. 

Maternal thyroid hormone is transferred to the fetus early in pregnancy and is postulated to 
regulate brain development. The ontogeny of TR isoforms and related splice variants in 9 first- 
trimester fetal brains by semi-quantitative RT-PCR analysis has been investigated. Expression of 
the TR-beta-1, TR-alpha-1, and TR-alpha-2 isoforms was detected from 8.1 weeks' gestation. An 
additional truncated species was detected with the TR-alpha-2 primer set, consistent with the TR- 
alpha-3 splice variant described in the rat. All TR-alpha-derived transcripts were coordinate* 
expressed and increased approximately 8-fold between 8.1 and 13.9 weeks' gestation. A more 
complex ontogenic pattern was observed for TR-beta-1, suggestive of a nadir between 8.4 and 12 0 
weeks' gestation. The authors concluded that these findings point to an important role for the TR- 
alpha-1 isoform in mediating maternal thyroid hormone action during first-trimester fetal brain 
development 

The identification of the several types of thyroid hormone receptor may explain the normal 
vanation in thyroid hormone responsiveness of various organs and the selective tissue 
abnormalities found in the thyroid hormone resistance syndromes. Members of sibships, who were 
resistant to thyroid hormone action, had retarded growth, congenital deafness, and abnormal 
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bones, but bad normal intellect and sexual maturation, as well as augmented cardiovascular 
activity. In this family abnormal T3 nuclear receptors in blood cells and fibroblasts have been 
demonstrated. The availability of cDNAs encoding the various thyroid hormone receptors was 
considered useful in determining the underlying genetic defect in this family. 

The ERBA oncogene has been assigned to chromosome 17. The ERBA locus remains on 
chromosome 17 in the t(l5;17) translocation of acute promyelocytic leukemia (APL). The 
thymidine kinase locus is probably translocated to chromosome 15; study of leukemia with 
t(17;21) and apparently identical breakpoint showed that TK was on 2iq+. By in situ hybridization 
of a cloned DNA probe of c-erb-A to meiotic pachytene spreads obtained from uncultured 
spermatocytes it has been concluded that ERBA is situated at 17q21.33-17q22, in the same region 
as the break that generated the t(15;17) seen in APL. Because most of the grains were seen in 
17q22, they suggested that ERBA is probably in the proximal region of 17q22 or at the junctidW 
between 17q22 and 17q21.33. By in situ hybridization it has been demonstrated, that that ERBA 
remains at 17ql l-ql2 in APL, whereas TP53, at 17q21-q22, is translocated to chromosome 15. 
Thus, ERBA must be at 17ql 1 .2 just proximal to the breakpoint in the APL translocation and just 
distal to it in the constitutional translocation. 

The aberrant THRA expression in nonfunctioning pituitary tumors has been hypothesized to 
reflect mutations in the receptor coding and regulatory sequences. They screened THRA mRNA 
and THRB response elements and ligand-binding domains for sequence anomalies. Screening 
THRA mRNA from 23 tumors by RNAse mismatch and sequencing candidate fragments 
identified 1 silent and 3 missense mutations, 2 in the common THRA region and 1 that was 
specific for the alpha-2 isoform. No THRB response element differences were detected in 14 
nonfunctioning tumors, and no THRB ligand-binding domain differences were detected m| 
nonfunctioning tumors. Therefore it has been suggested that the novel thyroid receptor mutations 
may be of functional significance in terms of thyroid receptor action, and furihei definition of their 
functional properties may provide insight into the role of thyroid receptors in growth control in 
pituitary cells. 

RAR-alpha 

A cDNA encoding a protein that binds retinoic acid with high affinity has been cloned rPetkovich 
et al., 1987, (42)]. The protein was found to be homologous to the receptors for steroid hormones, 
thyroid hormones, and vitamin D3, and appeared to be a retinoic acid-inducible transacting 
enhancer factor. Thus, the molecular mechanisms of the effect of vitamin A on embryonic 
development, differentiation and tumor cell growth may be similar to those described for other 
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members of this nuclear receptor family. In general, the DNA-binding domain is most high 
conserved, both within and between the 2 groups of receptors (steroid and thyroid); Using a cDN 
probe, the RAR-alpha gene has been mapped to 17q21 by in situ hybridization [Mattei et al., 198 
(43)]. Evidence has been presented for the existence of 2 retinoic acid receptors, RAR-alpha ar. 
RAR-beta, mapping to chromosome 17q21.1 and 3p24, respectively. The alpha and beta forms < 
RAR were found to be more homologous to the 2 closely related thyroid hormone receptors alph 
and beta, located on 17ql 1.2 and 3p25-p21, respectively, than to any other members of the nucle* 
receptor family. These observations suggest that the thyroid hormone and retinoic acid receptor 
evolved by gene, and possibly chromosome, duplications from a common ancestor, which itsel 
diverged rather early in evolution from the common ancestor of the steroid receptor group of th 
family. They noted that the counterparts of the human RARA and RARB genes are present in botl 
the mouse and chicken. The involvement of RARA at the APL breakpoint may explain why th. 
use of retinoic acid as a therapeutic differentiation agent in the treatment of acute myeloi. 
leukemias is limited to APL. Almost all patients with APL have a chromosomal translocate. 
t(15;17)(q22;q21). Molecular studies reveal that the translocation results in a chimeric gent 
through fusion between the PML gene on chromosome 15 and the RARA gene on chromosome 
17. A hormone-dependent interaction of the nuclear receptors RARA and RXRA with CLOCK 
and MOP4 has been presented. 

CDC 18 L. cnr.fi 

In yeasts, Cdc6 (Saccharomyces cerevisiae) and Cdclg (Schizosaccharomyces pombe) associate 
with the origin recognition complex (ORC) proteins to render cells competent for DNA 
replication. Thus, Cdc6 has a critical regulatory role in the initiation of DNA replication in yeast. 
cDNAs encoding Xenopus and human homologues of yeast CDC6 have been isolated [Williams et 
al., 1997, (44)]. They designated the human and Xenopus proteins P 62(cdc6). Independently, in a 
yeast 2-hybrid assay using PCNA as bait, cDNAs encoding the human CDC6/Cdcl8 homologue 
have been isolated [Saha et al, 1998, (45)]. These authors reported that the predicted 560-amino 
acid human protein shares approximately 33% sequence identity with the 2 yeast proteins. On 
Western blots of HeLa cell extracts, human CDC6/cdcl8 migrates as a 66-kD protein. Although 
Northern blots indicated that CDC6/Cdcl8 mRNA levels peak at the onset of S phase and diminish 
at the onset of mitosis in HeLa cells, the authors found that total CDC6/Cdcl8 protein level is 
unchanged throughout the cell cycle. Immunofluorescent analysis of epitope-tagged protein 
revealed that human CDC6/Cdcl8 is nuclear in Gl- and cytoplasmic in S-phase cells, suggesting 
that DNA replication may be regulated by either the translocation of this protein between the 
nucleus and cytoplasm or by selective degradation of the protein in the nucleus. 
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Immunoprecipitation studies showed that human CDC6/Cdcl8 associates in vivo with cyclin A, 
CDK2,and ORC1. The association of cyclin-CDK2 with CDC6/Cdcl8 was specifically inhibited 
by a factor present in mitotic cell extracts. Therefore it has been suggested that if the interaction 
between CDC6/Cdcl8 with the S phase-promoting factor cyclin-CDK2 is essential for the 
initiation of DNA replication, the mitotic inhibitor of this interaction could prevent a premature 
interaction until the appropriate time in Gl. Cdc6 is expressed selectively in proliferating but not 
quiescent mammalian cells, both in culture and within tissues in intact animals [Yan et al., 1998, 
(46)]. During the transition from a growth-arrested to a proliferative state, transcription of 
mammalian Cdc6 is regulated by E2F proteins, as revealed by a functional analysis of the human 
Cdc6 promoter and by the ability of exogenously expressed E2F proteins to stimulate the 
endogenous Cdc6 gene. Immunodepletion of Cdc6 by microinjection of anti-Cdc6 antibody 
blocked initiation of DNA replication in a human tumor cell line. The authors concluded that 
expression of human Cdc6 is regulated in response to mitogenic signals through transcriptional 
control mechanisms involving E2F proteins, and that Cdc6 is required for initiation of DNA 
replication in mammalian cells. 

Using a yeast 2-hybrid system, co-purification of recombinant proteins, and immunoprecipitation, 
it has been demonstrated lateron that an N-terminal segment of CDC6 binds specifically to PR48, 
a regulatory subunit of protein phosphatase 2A (PP2A). The authors hypothesized that 
dephosphorylation of CDC6 by PP2A, mediated by a specific interaction with PR48 or a related B- 
double prime protein, is a regulatory event controlling initiation of DNA replication in mammalian 
cells. By analysis of somatic cell hybrids and by fluorescence in situ hybridization the human 
p62(cdc6) gene has been to 17q21.3. 

TOP2A. TOP2. TOP2 alpha 0 

DNA tnnoisomerases are enzymes that control and alter the topologic states of DNA in both 
prokaryotes and eukaryotes. Topoisomerase H from eukaryotic cells catalyzes the relaxation of 
supercoiled DNA molecules, catenation, decatenation, knotting, and unknotting of circular DNA. 
It appears likely that the reaction catalyzed by topoisomerase H involves the crossing-over of 2 
DNA segments. It has been estimated that there are about 100,000 molecules of topoisomerase II 
per HeLa cell nucleus, constituting about 0.1% of the nuclear extract. Since several of the 
abnormal characteristics of ataxia-telangiectasia appear to be due to defects in DNA processing, 
screening for these enzyme activities in 5 AT cell lines has been performed [Singh et al., 1988, 
(47)]. In comparison to controls, the level of DNA topoisomerase H, determined by unknotting of 
P4 phage DNA, was reduced substantially in 4 of these cell lines and to a lesser extent in the fifth. 
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DNA topoisomerase I, assayed by relaxation of supercoil DNA, was found to be present at nor 
levels. 

The entire coding sequence of the human TOP2 gene has been determined [Tsai-Pflugfelder et 
1988, (48)]. 

5 In addition human cDNAs that had been isolated by screening a cDNA library derived fro? 
mechlorethamine-resistant Burkitt lymphoma cell line (Raji-HN2) with a Drosophila Top< 
cDNA had been sequenced [Chung et al., 1989, (49)]. The authors identified 2 classes of seque 
representing 2 TOP2 isoenzymes, which have been named TOP2A and TOP2B. The sequence < 
of the TOP2A cDNAs is identical to that of an internal fragment of the TOP2 cDNA isolated 

10 Tsai-Pflugfelder et al., 1988 (48). Southern blot analysis indicated that the TOP2A and TOl 
cDNAs are derived from distinct genes. Northern blot analysis using a TOP2A-specific pr 
detected a 6.5-kb transcript in the human cell line U937. Antibodies against a TOP2A pep 
recognized a 170-kD protein in U937 cell lysates. Therefore it was concluded that their c 
provide genetic and immunochemical evidence for 2 TOP2 isozymes. The complete structure! 

15 the TOP2A and TOP2B genes has been reported [Lang et al., 1998, (50)]. The TOP2A gene sp 
approximately 30 kb and contains 35 exons. 

Tsai-Pflugfelder et al., 1988 (48) showed that the human enzyme is encoded by a single-copy g 
which they mapped to 17q21-q22 by a combination of in situ hybridization of a cloned fragmen 
metaphase chromosomes and by Southern hybridization analysis with a panel of mouse : hur 

20 hybrid cell lines. The assignment to chromosome 17 has been confirmed by the study of som 
cell hybrids. Because of co-amplification in an adenocarcinoma cell line, it was concluded that 
TOP2A and ERBB2 genes may be closely linked on chromosome 17 [Keith et al., 1992, (5 
Using probes that detected RFLPs at both the TOP2A and TOP2B loci, the demonstn 
heterozygosity at a frequency of 0.17 and 0.37 for the alpha and beta loci, respectively. The mc 

25 homologue was mapped to chromosome 11 [Kingsmore et al., 1993, (52)]. The structure 
function of type II DNA topoisomerases has been reviewed [Watt et al., 1994, (53)]. D 
topoisomerase II-alpha is associated with the pol II holoenzyme and is a required componen 
chromatin-dependent co-activation. Specific inhibitors of topoisomerase II blocked transcrip- 
on chromatin templates, but did not affect transcription on naked templates. Addition of puri: 

30 topoisomerase II-alpha reconstituted chromatin-dependent activation activity in reactions \ 
core pol II. Therefore the transcription on chromatin templates seems to result in the accumula 
of superhelical tension, making the relaxation activity of topoisomerase II essential for produc 
RNA synthesis on nucleosomal DNA. 



BHC 03 1051 



-36 

IGFBP4 



Six structurally distinct insulin-like growth factor binding proteins have been isolated and their 
cDNAs cloned: IGFBP1, IGFBP2, IGFBP3, IGFBP4, IGFBP5 and IGFBP6. The proteins display 
strong sequence homologies, suggesting that they are encoded by a closely related family of genes. 
The IGFBPs contain 3 structurally distinct domains each comprising approximately one-third of 
the molecule. The N-terminal domain 1 and the C-terminal domain 3 of the 6 human IGFBPs show 
moderate to high levels of sequence identity including 12 and 6 invariant cysteine residues m 
domains 1 and 3, respectively (IGFBP6 contains 10 cysteine residues in domain 1), and are 
thought to be the IGF binding domains. Domain 2 is defined primarily by a lack of sequence 
identity among the 6 IGFBPs and by a lack of cysteine residues, though it does contain 2 cysteines 
in IGFBP4. Domain 3 is homologous to the thyroglobulin type I repeat unit. Recombinant human 
insulin-like growth factor binding proteins 4, 5, and 6 have been characterized by their express^ 
in yeast as fusion proteins with ubiquitin [Kiefer et al., 1992, (54)]. Results of the study suggested 
to the authors that the primary, effect of the 3 proteins.is the attenuation of IGF activity and 
suggested that they contribute to the control of IGF-mediated cell growth and metabolism. 
Moreover, IGFBPs have influence on EGFR and Her-2/neu mediated signaling. Addition of 
IGFBPs to Her-2/neu overexposing cells at least in part blocks growth and survival 
characteristica of the respective cells. 

Based on peptide sequences of a purified insulin-like growth factor-binding protein (IGFBP) rat 
IGFBP4 has been cloned by using PCR [Shimasaki et al., 1990, (55)]. They used the rat cDNA to 
clone the human ortholog from a liver cDNA library. Human IGFBP4 encodes a 258-amino acid 
polypeptide, which includes a 21-amino acid signal sequence. The protein is very hydrophihc, 
which may facilitate its ability as a carrier protein for the IGFs in blood. Northern blot analysing 
rat tissues revealed expression in all tissues examined, with highest expression in liver. It was 

--^t- •_. j j i m-ni;f»ratirm Thii penomic region 

stated that IGFBP4 acts as an lnmmtor oi iur-umu^ u r - 

containing the IGFBP gene. The gene consists of 4 exons spanning approximately 15 kb of 
genomic DNA has been examined [Zazzi et al., 1998, (56)]. The upstream region of the gene 
contains a TATA box and a cAMP-responsive promoter. 

By in situ hybridization, the IGFBP4 gene was mapped to 17ql2-q21 [Bajalica et al., 1992, (57)]. 
Because the hereditary breast-ovarian cancer gene BRCAl had been mapped to the same region, it 
has been investigated whether IGFBP4 is a candidate gene by linkage analysis of 22 BRCAl 
families; the finding of genetic recombination suggested that it is not the BRCAl gene [Tonin et 
al., 1993, (58)]. 
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EBI1. CC.R7. CMKBR7 

Using PCR with degenerate oligonucleotides, a lymphoid-specific member of the G proteii 
coupled receptor family has been identified and mapped mapped to 17ql2-q21.2 by analysis < 
human/mouse somatic cell hybrid DNAs and fluorescence in situ hybridization. It has been show 
that this receptor had been independently identified as the Epstein-Barr-induced cDNA (symbc 
EBI1) [Birkenbach et al., 1993, (59)]. EBI1 is expressed in normal lymphoid tissues and in seven 
B- and T-lymphocyte cell lines. While the function and the ligand for EBI1 remains unknown, il 
sequence and gene structure suggest that it is related to receptors that recognize chemoattractanti 
such as interleukin-8, RANTES, C5a, and fMet-Leu-Phe. Like the chemoattractant receptors, EBI 
contains intervening sequences near its 5-prime end; however, EBI1 is unique in that both of it 
introns interrupt the coding region of the first extracellular domain. Mouse Ebil cDNA has beei 
isolated and found to encode a protein with 86% identity to the human homologue. 

Subsets of murine CD4+ T cells localize to different areas of the spleen after adoptive transfer 
Naive and T helper-1 (TH1) cells, which express CCR7, home to the periarteriolar lymphok 
sheath, whereas activated TH2 cells, which lack CCR7, form rings at the periphery of the T-cel 
zones near B-cell follicles. It has been found that retroviral transduction of TH2 cells with CCR', 
forced them to localize in a THl-like pattern and inhibited their participation in B-cell help in vivc 
but not in vitro. Apparently differential expression of chemokine receptors results in unique 
cellular migration patterns that are important for effective immune responses. 

CCR7 expression divides human memory T cells into 2 functionally distinct subsets. CCR7- 
memory cells express receptors for migration to inflamed tissues and display immediate effector 
function. Li contrast, CCR7 + memory cells express lymph node homing receptors and lack 
immediate effector function, but efficiently stimulate dendritic cells and differentiate into CCR7" 
effector cells upon secondary stimulation. The CCR7+ and CCRT T cells, named central memory 
(T-CM) and effector memory (T-EM), differentiate in a step-wise fashion from naive T cells, 
persist for years after immunization, and allow a division of labor in the memory response. 

CCR7 expression in memory CD8 + T lymphocyte responses to HTV and to cytomegalovirus 
(CMV) tetramers has been evaluated. Most memory T lymphocytes express CD45RO, but a 
fraction express instead the CD45RA marker. Flow cytometric analyses of marker expression and 
cell division identified 4 subsets of HIV- and CMV-specific CD8 + T ceils, representing a lineage 
differentiation pattern: CD45RA + CCR7 + (double-positive); CD45RACCR7+; CD45RA'CCR7* 
(double-negative); CD45RA + CCRT. The capacity for cell division, as measured by 5-(and 
6-)carboxyl-fluorescein diacetate, succinimidyl ester, and intracellular staining for the Ki67 
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nuclear antigen, is largely confined to the CCR7 + subsets and occurred more rapidly in cells that 
are also CD45RA*. Although the double-negative cells did not divide or expand after stimulation, 
they did revert to positivity for either CD45RA or CCR7 or both. The CD45RA + CCRT cells, 
considered to be terminally differentiated, fail to divide, but do produce interferon-gamma and 
express high levels of perforin. The representation of subsets specific for CMV and for HIV is 
distinct. Approximately 70% of HIV-specific CD8 + memory T cells are double-negative or 
preterminally differentiated compared to 40% of CMV-specific cells. Approximately 50% of the 
CMV-specific CD8+ memory T cells are terminally differentiated compared to fewer than 10% of 
the HIV-specific cells. It has been proposed that terminally differentiated CMV-specific cells are 
poised to rapidly intervene, while double-positive precursor cells remain for expansion and 
replenishment of the effector cell pool. Furthermore, high-dose antigen tolerance and the depletion 
of HIV-specific CD4 + helper T-cell activity may keep the HIV-specific memory CD8 + T cells at 
the double-negative stage, unable to differentiate to the terminal effector state. B lymphocyte^ 
recirculate between B cell-rich compartments (follicles or B zones) in secondary lymphoid organs, 
surveying for antigen. After antigen binding; B cells move to the boundary of B and T zones to 
interact with T-helper cells. Furthermore it has been demonstrated that antigen-engaged B cells 
have increased expression of CCR7, the receptor for the T-zone chemokines CCL19 (also known 
as ELC) and CCL21, and that they exhibit increased responsiveness to both chemoattractants. In 
mice lacking lymphoid CCL19 and CCL21 chemokines, or with B cells that lack CCR7, antigen 
engagement fails to cause movement to the T zone. Using retroviral-mediated gene transfer, the 
authors demonstrated that increased expression of CCR7 is sufficient to direct B cells to the T 
zone. Reciprocally, overexpression of CXCR5, the receptor for the B-zone chemokine CXCL13, is 
sufficient to overcome antigen-induced B-cell movement to the T zone. This points toward a 
mechanism of B-cell relocalization in response to antigen, and established that cell position hi 
vivo can be determined by the balance of responsiveness to chemoattractants made in separate buT 

— 

RAFS7. SMARCE 1 

The SWI/SNF complex in S. cerevisiae and Drosophila is thought to facilitate transcriptional 
activation of specific genes by antagonizing chromatin-mediated transcriptional repression. The 
complex contains an ATP-dependent nucleosome disruption activity that can lead to enhanced 
binding of transcription factors. The BRGl/brm-associated factors, or BAF, complex in mammals 
is functionally related to SWI/SNF and consists of 9 to 12 subunits, some of which are 
homologous to SWI/SNF subunits. A 57-kD BAF subunit, BAF57, is present in higher eukaryotes, 
but not in yeast. Partial coding sequence has been obtained from purified BAF57 from extracts of 
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a human cell line [Wang et ah, 1998, (60)]. Based on the peptide sequences, they identified 
cDNAs encoding BAF57. The predicted 411-amino acid protein contains an HMG domain 
adjacent to a kinesin-like region. Both recombinant BAF57 and the whole BAF complex bind 4- 
way junction (4WJ) DNA, which is thought to mimic the topology of DNA as it enters or exits the 
nucleosome. The BAF57 DNA-binding activity has characteristics similar to those of other HMG 
proteins. It was found that complexes with mutations in the BAF57 HMG domain retain their 
DNA-binding and nucleosome-disruption activities. They suggested that the mechanism by which 
mammalian SWI/SNF-like complexes interact with chromatin may involve recognition of higher- 
order chromatin structure by 2 or more DNA-binding domains. RNase protection studies and 
Western blot analysis revealed that BAF57 is expressed ubiquitously. Several lines of evidence 
point toward the involvement of SWI/SNF factors in cancer development [Klochendler-Yeivin et 
al., 2002, (61)]. Moreover, SWI/SNF related genes are assigned to chromosomal regions that are 
frequently involved in somatic rearrangements in human cancers [Ring et al., 1998, (62)]. In this 
respect it is interesting that some of the SWI/SNF family members (i.e. SMARCC1, SMARCC2, 
SMARCD1 and SMARCD22 are neighboring 3 of the eucaryotic ARCHEONs we have identified 
(i.e. 3p21-p24, 12ql3-ql4 and 17q respectively)and which are part of the present invention. In this 
invention we could also map SMARCE1/BAF57 to the 17ql2 region by PCR karyotyping. 

KRT10.K10 

Keratin 10 is an intermediate filament (IF) chain which belongs to the acidic type I family and is 
expressed in terminally differentiated epidermal cells. Epithelial cells almost always co-express 
pairs of type I and type II keratins, and the pairs that are co-expressed are highly characteristic of a 
given epithelial tissue. For example, in human epidermis, 3 different pairs of keratins are 
expressed: keratins 5 (type II) and 14 (type I), characteristic of basal or proliferative cells; keratins 
1 (type II) and 10 (type I), characteristic of superbasal terminally differentiating cells; and keratins 
6 (type II) and 16 (type I) (and keratin 17 [type I]), characteristic of cells induced to hyper- 
proliferate by disease or injury, and epithelial cells grown in cell culture. The nucleotide sequence 
of a 1,700 bp cDNA encoding human epidermal keratin 10 (56.5 kD) [Darmon et al., 1987, (63)] 
has been published as well as the complete amino acid sequence of human keratin 10 [Zhou et al., 
1988, (64)]. Polymorphism of the KRT10 gene, restricted to insertions and deletions of the 
glycine-richquasipeptide repeats that form the glycine-loop motif in the C-terminal domain, have 
been extensively described [Korge et al., 1992, (65)]. 

By use of specific cDNA clones in conjunction with somatic cell hybrid analysis and in situ 
hybridization, KRT10 gene has been mapped to 17ql2-q21 in a region proximal to the breakpoint 
at 17q21 that is involved in a t(17;2i)(q21;q22) translocation associated with a form of acute 
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leukemia. KRT10 appeared to be telomeric to 3 other loci that map in the same region: CSF3, 
ERBA1, and HER2 [Lessin et al., 1988, (66)]. NGFR and HOX2 are distal to K9. It has been 
demonstrated that the KRT10, KRT13, and KRT15 genes are located in the same large pulsed field 
gel electrophoresis fragment [Romano et al., 1991, (67)]. A correlation of assignments of the 3 
genes makes 17q21-q22 the likely location of the cluster. Transgenic mice expressing a mutant 
keratin 10 gene have the phenotype of epidermolytic hyperkeratosis , thus suggesting that a 
genetic basis for the human disorder resides in mutations in genes encoding suprabasal keratins 
KRT1 or KRT10 [Fuchs et al 1992, (68)]. The authors also showed that stimulation of basal cell 
proliferation can result from a defect in suprabasal cells and that distortion of nuclear shape or 
alterations in cytokinesis can occur when an intermediate filament network is perturbed. In a 
family with keratosis palmaris et plantaris without blistering either spontaneously or in response to 
mild mechanical or thermal stress and with no involvement of the skin and parts of the body oth&L 
than the palms and soles, a tight linkage to an insertion-deletion polymorphism in the C-termimff 
coding region of the KRT10 gene (maximum lod score = 8.36 at theta = 0.00) was found [Rogaev 
et al., 1993, (69)]. It is noteworthy that it was a rare, high molecular weight allele of the KRT10 
polymorphism that segregated with the disorder. The allele was observed once in 96 independent 
chromosomes from unaffected Caucasians. The KRT10 polymorphism arose from the 
insertion/deletion of imperfect (CCG)n repeats within the coding region and gave rise to a variable 
glycine loop motif in the C-terminal tail of the keratin 10 protein. It is possible that there was a 
pathogenic role for the expansion of the imperfect trinucleotide repeat. 

KRT12.K12 

Keratins are a group of water-insoluble proteins that form 10 nm intermediate filaments in 
epithelial cells. Approximately 30 different keratin molecules have been identified. They can^^ 
divided into acidic and basic-neutral subfamilies according to their relative charges, 
immunoreaciivily, aiiu sequence Homologies to types I and H wool keratins, respectively. In vivo, 
a basic keratin usually is co-expressed and 'paired 1 with a particular acidic keratin to form a 
heterodimer. The expression of various keratin pairs is tissue specific, differentiation dependent, 
and developmental^ regulated. The presence of specific keratin pairs is essential for the 
maintenance of the integrity of epithelium. For example, mutations in human K14/K5 pair and the 
K10/K1 pair underlie the skin diseases, epidermolysis bullosa simplex and epidermolytic 
hyperkeratosis, respectively. Expression of the K3 and K12 keratin pair have been found in the 
cornea of a wide number of species, including human, mouse, and chicken, and is regarded as a 
marker for corneal -type epithelial differentiation. The murine Krtl2 (Kit LI 2) gene and 
demonstrated that its expression is corneal epithelial cell specific, differentiation dependent, and 
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developmental^ regulated [Liu et al., 1993, (70)]. The corneal-specific nature of keratin 12 ge 
expression signifies keratin 12 plays a unique role in maintaining normal corneal epitheli 
function. Nevertheless, the exact function of keratin 12 remains unknown and no hereditary hum 
corneal epithelial disorder has been linked directly to the mutation in the keratin 12 gene. As p* 
of a study of the expression profile of human corneal epithelial cells, a cDNA with an opc 
reading frame highly homologous to the cornea-specific mouse keratin 12 gene has been isolatt 
[Nishida et al., 1996, (71)]. To elucidate the function of keratin 12 knockout mice lacking tl 
Krtl.12 gene have been created by gene targeting techniques. The heterozygous mice appeare 
normal. Homozygous mice developed normally and suffered mild corneal epithelial erosion. Th 
corneal epithelia were fragile and could be removed by gentle rubbing of the eyes or brushing. Th 
corneal epithelium of the homozygotes did not express keratin 12 as judged b 
immunohistochemistry, Western immunoblot analysis with epitope-specific anti-keratin 1 
antibodies, Northern hybridization, and in situ hybridization with an antisense keratin 1: 
riboprobe. The KRT12 gene has been mapped to 17q by study of radiation hybrids and localized i 
to the type I keratin cluster in the interval between D17S800 and D17S930 (17ql2-q21) rNishid 
et al., 1997, (72)]. The authors presented the exon-intron boundary structure of the KRT12 gen« 
and mapped the gene to 17ql2 by fluorescence in situ hybridization. The gene contains 7 .introns 
defining 8 exons that cover the coding sequence. Together the exons and introns spar 
approximately 6 kb of genomic DNA. 

Meesmann corneal dystrophy is an autosomal dominant disorder causing fragility of the anterior 
corneal epithelium, where the cornea-specific keratins K3 and K12 are expressed. Dominant- 
negative mutations in these keratins might be the cause of Meesmann corneal dystrophy. Indeed, 
linkage of the disorder to the K12 locus in Meesmann's original German kindred [Meesmann and 
Wilke, 1939, (73)] with Z(max) = 7.53 at theta = 0.0 has been found. In 2 pedigrees from Northern 
Ireland, they found that the disorder co-segregated with K12 in one pedigree and K3 in the other. 
Heterozygous missense mutations in K3 or in K12 (R135T, V143L.) in each family have been 
identified. All these mutations occurred in highly conserved keratin helix boundary motifs, where 
dominant mutations in other keratins have been found to compromise cytoskeletal function 
severely, leading to keratinocyte fragility. 

The regions of the human KRT12 gene have been sequenced to enable mutation detection for all 
exons using genomic DNA as a template [Corden et al., 2000, (74)]. The authors found that the 
human genomic sequence spans 5,919 bp and consists of 8 exons. A microsatellite dinucleotide 
repeat was identified within intron 3, which was highly polymorphic and which they developed for 
use in genotype analysis. In addition, 2 mutations in the helix initiation motif of K12 were found 



BHC03 1 051 



-42- 

in families with Meesmann corneal dystrophy. In an American kindred, a missense M129T 
mutation was found in the KRT12 gene. They stated that a total of 8 mutations in the KRT12 gene 
had been reported. 

Genetic interactions w ithin ARCHEONs 

Genes involved in genomic alterations (amplifications, insertions, translocations, deletions, etc.) 
exhibit changes in their expression pattern. Of particular interest are gene amplifications, which 
account for gene copy numbers >2 per cell or deletions accounting for gene copy numbers <2 per 
cell. Gene copy number and gene expression of the respective genes do not necessarily correlate. 
Transcriptional overexpression needs an intact transcriptional context, as determined by regulatory 
regions at the chromosomal locus (promotor, enhancer and silencer), and sufficient amounts of 
transcriptional regulators being present in effective combinations. This is especially true f^Q 
genomic regions, which expression is tightly regulated in specific tissues or during specific 
developmental stages. ARCHEONs are specified by gene clusters of more than two genes being 
directly neighboured or in chromosomal order, interspersed by a maximum of 10, preferably 7, 
more preferably 5 or at least 1 gene. The interspersed genes are also co-amplified but do not 
directly interact with the ARCHEON. Such an ARCHEON may spread over a chromosomal region 
of a maximum of 20, more preferably 10 or at least 6 Megabases. The nature of an ARCHEON is 
characterized by the simultaneous amplification and/or deletion and the correlating expression (i.e. 
upregulation or downregulation respectively) of the encompassed genes in a specific tissue, cell 
type, cellular or developmental state or time point. Such ARCHEONs are commonly conserved 
during evolution, as they play critical roles during cellular development. In case of these 
ARCHEONs whole gene clusters are overexpressed upon amplification as they harbor self- 
regulatory feedback loops, which stabilize gene expression and/or biological effector funct^ 
even in abnormal biological settings, or are regulated by very similar transcription factor 
combinations, reflecting their simultaneous tunction in specific tissues ai certain developmental 
stages. Therefore, the gene copy numbers correlates with the expression level especially for genes 
in gene clusters functioning as ARCHEONs. In case of abnormal gene expressions in neoplastic 
lesions it is of great importance to know whether the self-regulatory feedback loops have been 
conserved as they determine the biological activity of the ARCHEON gene members. 

The intensive interaction between genes in ARCHEONs is described for the 17q2l ARCHEON 
(Fig. 1) by way of illustration not by limitation. In one embodiment the presence or absence of 
alterations of genes within distinct genomic regions are correlated with each other, as exemplified 
for breast cancer cell lines (Fig.3 and Fig. 4). This confers to the discovery of the present 
invention, that multiple interactions of said gene products of defined chromosomal localizations 
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happen, that according to their respective alterations in abnormal tissue have predictiv 
diagnostic, prognostic and/or preventive and therapeutic value. These interactions are media* 
directly or indirectly, due to the fact that the respective genes are part of interconnected < 
independent signaling networks or regulate cellular behavior (differentiation status, proliferate 
and /or apoptotic capacity, invasiveness, drug responsiveness, immune modulatory activities) in 
synergistic, antagonistic or independent fashion. The order of functionally important genes withi 
the ARCHEONs has been conserved during evolution (e.g. the ARCHEON on human chromosor 
17q21 is present on mouse chromosome 11). Moreover, it has been found that the 17q2 
ARCHEON is also present on human chromosome 3p21 and 12ql3, both of which are als. 
involved in amplification events and in tumor development. Most probably these homologou 
ARCHEONs were formed by duplications and rearrangements during vertebrate evolution 
Homologous ARCHEONs consist of homologous genes and/or isoforms of specific gene familie 
(e.g. RARA or RARB or RARG, THRA or THRB, TOP2A or TOP2B, RAB5A or RAB5B 
BAF1 70 or BAF 155, BAF60A or BAF60B, WNT5A or WNT5B, IGFBP4 or IGFBP6). Moreovei 
these regions are flanked by homologous chromosomal gene clusters (e.g. CACN, SCYA, HOX 
Keratins). These ARCHEONs have diverged during evolution to fulfill their respective functions 
in distinct tissues (e.g. the 17q21 ARCHEON has one of its main functions in the central nervous 
system). Due to their tissue specific function extensive regulatory loops control the expression of 
the members of each ARCHEON. During tumor development these regulations become critical for 
the characteristics of the abnormal tissues with respect to differentiation, proliferatibn, drug 
responsiveness, invasiveness. It has been found that the co-amplification of genes: within 
ARCHEONs can lead to co-expression of the respective gene products. Some of said genes also 
exhibit additional mutations or specific patterns of polymorphisms, which are substantial for the 
oncogenic capacities of these ARCHEONs. It is one of the critical features of such amplicons, 
which members of the ARCHEON have been conserved during tumor formation (e.g. during 
amplification and deletion events), thereby defining these genes as diagnostic marker genes. 
Moreover, the expression of the certain genes within the ARCHEON can be influenced by other 
members of the ARCHEON, thereby defining the regulatory and regulated genes as target genes 
for therapeutic intervention. It was also observed, that the expression of certain members of the 
ARCHEON is sensitive to drug treatment (e.g. TOP02 alpha, RARA, THRA, HER-2) which 
defines these genes as "marker genes". Moreover several other genes are suitable for therapeutic 
intervention by antibodies (CACNB1, EBI1), ljgands (CACNB1) or drugs like e.g. kinase 
inhibitors (CrkRS, CDC6). The following examples of interactions between members of 
ARCHEONs are offered by way of illustration, not by way of limitation. 
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EBI1/CCR7 is lymphoid-specific member of the G protein-coupled receptor family. EBI1 
recognizes chemoattractants, such as interleukin-8, SCYAs, Rantes, C5a, and fMet-Leu-Phe. The 
capacity for cell division is largely confined to the CCR7* subsets in lymphocytes. Double- 
negative cells did not divide or expand after stimulation. CCR7" cells, considered to be terminally 
differentiated, fail to divide, but do produce interferon-gamma and express high levels of perforin. 
EBI1 is induced by viral activities such as the Eppstein-Barr- Virus. Therefore, EBI1 is associated 
with transformation events in lymphocytes. A functional role of EBI1 during tumor formation in 
non-lymphoid tissues has been investigated in this invention. Interestingly, also ERBA and ERBB, 
located in the same genomic region, are associated with lymphocyte transformation. Moreover, 
ligands of the receptor (i.e. SCYA5/Rantes) are in genomic proximity on 17q. Abnormal 
expression of both of these factors in lymphoid and non-lymphoid tissues establishes an 
autorgulatory feedback loop, inducing signaling events within the respective cells. Expression of 
lymphoid factors has effect on immune cells and modulates cellular behavior. This is of particul™ 
interest with regard to abnormal breast tissue being infiltrated by lymphocytes. In line with this, 
another immunmodulatory and proliferation factor is located nearby x>n 17q21. Granulocyte - 
colony-stimulating factor (GCSF3) specifically stimulates the proliferation and differentiation of 
the progenitor cells for granulocytes. A stimulatory activity from a glioblastoma multiforme cell 
line being biologically and biochemically indistinguishable from GCSF produced by a bladder cell 
line has also been found. Colony-stimulating factors not only affects immune cells, but also induce 
cellular responses of non-immune cells, indicating possible involvement in tumor development 
upon abnormal expression. In addition several other genes of the 17q21 ARCHEON are involved 
in proliferation, survival, differentiation of immune cells and/or lymphoblastic leukemia, such as 
MLLT6, ZNF144 and ZNFN1A3, again demonstrating the related functions of the gene products 
in interconnected key processes within specific cell types. Aberrant expression of more than one of 
these genes in non-immune cells constitutes signalling activities, that contribute to the oncogen^ 
activities that derive solely from overexpression of the Her-2/neu gene. 

PPARBP has been found in complex with the tumorsuppressor gene of the p53 family. Moreover, 
PPARBP also binds to PPAR-alpha (PPARA), RAR-alpha (RARA), RXR, THRA and TR-beta-1. 
Due to it's ability to bind to thyroid hormone receptors it has been named TRIP2 and TRAP220. In 
this complexes PPARBP affects gene regulatory activities. Interestingly, PPARBP is located in 
genomic proximity to its interaction partners THRA and RARA. We have found PPARBP to be 
co-amplified with THRA and RARA in tumor tissue. THRA has been isolated from avian 
erythroblastosis virus in conjunction with ERBB and therefore was named ERBA. ERBA 
potentiates ERBB by blocking differentiation of erythroblasts at an immature stage. ERBA has 
been shown to influence ERBB expression. In this setting deletions of C-terminal portions of the 
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THRA gene product are of influence. Aberrant THRA expression has also been found 
nonfunctioning pituitary tumors, which has been hypothesized to reflect mutations in the recej 
coding and regulatory sequences. THRA function promotes tumor cell development by regulal 
gene expression of regulatory genes and by influencing metabolic activities (e.g. of key enzyi 
of alternative metabolic pathways in tumors such as malic enzyme and genes responsible 
lipogenesis). The observed activities of nuclear receptors not only reflect their transactivat 
potential, but are also due to posttranscriptional activities in the absence or presence of ligar 
Co-amplification of THRA /ERBA and ERBB has been shown, but its influence on tur 
development has been doubted as no overexpression could be demonstrated in breast tumors [ 
de Vijver et al., 1987, (75)]. THRA and RARA are part of nuclear receptor family whose funct 
can be mediated as monomers, homodimers or heterodimers. RARA regulates differentiation c 
broad spectrum of cells. Interactions of hormones with ERBB expression has been investigal 
Ligands of RARA can inhibit the expression of amplified ERBB genes in breast turn 
[Offterdinger et al., 1998, (76)]. As being part of this invention co-amplification and co-expressior 
THRA and RARA could be shown. It was also found that multiple genes, which are regulated 
members of the thyroid hormone receptor - and retinoic acid receptor family, are different^ 
expressed in tumor samples, corresponding to their genomic alterations (amplification, mutati 
deletion). These hormone receptor genes and respective target genes are useful to discrimin 
patient samples with respect to clinical features. 

By expression analysis of multiple normal tissues, tumor samples and tumor cell lines i 
subsequent clustering of the 17q21 region, it was found that the expression profile of Her-2/i 
positive tumor cells and tumor samples exhibits similarities with the expression pattern of tia 
from the central nervous system (Fig. 2). This is in line with the observed malformations in 
central nervous system of Her-2/neu and THRA knock-out mice. Moreover, it was found t 
NEUROD2, a nuclear factor involved specifically in neurogenesis, is commonly expressed in 
respective samples. This led to the definition of the 17q21 Locus as being an "ARCHEOl 
whose primary function in normal organ development is defined to the central nervous sysfc 
Surprisingly, the expression of NEUROD2 was affected by therapeutic intervention. Striking 
also ZNF144, TEM7, PIP5K and PPP1R1B are expressed in neuronal cells, where they disp 
diverse tissue specific functions. 

In addition Her-2/neu is often co-amplified with GRB7, a downstream member of the signal 
cascade being involved in invasive properties of tumors. Surprisingly, we have found anof 
member of the Her-2/neu signaling cascade being overexpressed in primary breast tumors TO 
(= 'Transducer of ERBB signaling"). Strong overexpression of TOB1 corellated with wea : 
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overexpression of Her-2/neu, already indicating its involvement in oncogenic signaling activities. 
Amplification of Her-2/neu has been assigned to enhanced proliferative capacity, due to the 
identified downstream components of the signaling cascade (e.g. Ras-Raf-MAPK). In this respect 
it was surprising that some cdc genes, which are cell cycle dependent kinases, are part of the 
5 amplicons, which upon altered expression have great impact on cell cycle progression. 

The ARCHEONS on 17q21 and 12ql3 are very closely related, as they do not only harbor 
isoforms of specific genes (e.g. CACNB1 vs. CACNB3, ERBB2 vs. ERBB3, RARA vs. RARG, 
see below), but are even flanked by whole gene clusters, consisting of multiple isoforms of one 
gene family positioned in tandem, such as the keratin and the HOX gene cluster. In this respect the 
10 simultaneous presence of keratins and receptors of the EGFR family, i.e. ERBB2 (Her-2/neu) and 
ERBB3 (Her-3) is of special interest, as the expression of individual keratins is very tightly 
controlled by the EGFR signalling pathway. ^ 

.Keratins are a group of water-insoluble proteins that form 10 nm intermediate filaments in 
epithelial cells. Approximately 30 different keratin molecules have been identified. They can be 

15 divided into acidic and basic-neutral subfamilies according to their relative charges, 

immunoreactivity, and sequence homologies to types I and II wool keratins, respectively. In vivo, 
a basic keratin usually is co-expressed and 'paired 1 with a particular acidic keratin to form a 
heterodimer. The expression of various keratin pairs is tissue specific, differentiation dependent, 
and developmentally regulated. The presence of specific keratin pairs is essential for the 

20 maintenance of the integrity of epithelium. Alterations of keratin expression have been observed in 
tumor epithelium, with an abnormal keratin pattern being expressed in tumor cells compared to the 
adjacent normal tissue. Mutations in human K14/K5 pair and the K10/K1 pair underlie skin 
diseases such as epidermolysis bullosa simplex and epidermolytic hyperkeratosis. The express^p 
of these and other keratins within the skin is tightly regulated. For example, the expression of 

25 K14/K5 pair is restricted to the basai ceii iaycr of the skin displaying no overlap with the K10/K1 
pair, which is solely expressed in the suprabasal layer. Gene expression is very tightly controlled 
by an interplay of multiple signalling cascades such as the EGFR, TGFR, sonic hedgehog and wnt- 
signaling, involving receptor tyrosine kinases and serin threonin kinases. In addition, 
posttranslational modifications such as serine/threonine and/or tyrosine phosphorylation events 

30 affect keratin function, and can be attributed to receptor tyrosine kinase signalling and MAPK and 
ERK activity. Posttranlational modifications of keratins not only alters the solubility of keratins, 
but also affects nuclear and signalling functions (e.g. after association with 14-3-3 protein). In 
addition, we did observe genomic alteration of the keratin gene clusters perturbing keratin 
expression pattern. 
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Moreover, the physical interaction of keratins, which are located in ARCHEONs of differen 
chromosomes and whose cell type specific expression at distinct differentiation status is regulatec 
by members of the same ARCHEONs is a superb example of the genetic interaction oJ 
ARCHEON genes. Examples of this tight interaction between the 12ql3 and 17q21 ARCHEONs 
are the expression and physical interaction of keratin 5 (basic keratin Type II located on 12ql3] 
and keratin 14 (acidic keratin Type I located on 17q21) in the basal layer of the skin, which is shut 
off in the suprabasal layer and compensated by the expression and physical interaction of keratin I 
(basic keratin Type II located on 12ql3) and keratin 10 (acidic keratin Type I located on 17q21). 
Diverse control mechanisms confer this exclusive expression control including chromosomal 
positioning and growth factor signaling activities. Interestingly, critical keratins are 
chromosomally postioned in an ordered fashion reflecting their related but exclusive function in 
different keratin pairs and in specific tissues, resembling the structure and function relationship of 
the hox gene clusters on the same chromosomes. Moreover, keratins whose mutation result in 
specific skin disorders (e.g. mutation of K5 and K14 results in hand and foot syndrom) are located 
at similiar positions within the ARCHEONs on chromosome 17q21 and 12ql3. The genes are in 
close proximity to genes involved in signaling events (e.g. ERBBs and RARs) regulating 
proliferation, differentiation and apoptotic events also in the skin tissue. For example Her-2/neu is 
specifically expressed within the basal layer of the skin, where assymmetric cell divisions of adult 
stem cells or ealry progenitor cells thereof give rise to a non-differentiated daughter cell residing 
in the basal layer and a differentiating daughter cell which is subsequently moving to the 
suprabasal compartment. These assymetric cell divisions guarantee the self-renewal arid the 
cellular homeostasis of the skin tissue. This is of importance for the biological functions of the 
skin such as barrier function towards the environmental stress including infectious agents. 
Perturbation of the signalling activities within the skin results in diseases similiar to the hereditary 
disorders reflecting mutations of specific keratin genes. In clinical studies it has been shown, that 
blocking EGFR signalling by antibody-treatment (e.g. cetuximab) and small molecule inhibitors 
(e.g. Iressa) targeted to the receptor tyrosin kinases can result in skin diseases (e.g. acne-like rash) 
of grade I, II or III. It is part of this invention, that these skin diseases not only reflect side effects 
of the respective treatments, but are an example for systemic changes occuring as a consequence 
of therapeutic regimen, thereby indicating suscebility of the endogenous signaling network to the 
therapeutic agents. This observation can have consequences on therapeutic decisions, as the 
therapeutic regimen are normally stopped or is reduced upon occurence of side effects. However, 
as the side effects (e.g. the skin dieseases occurring under anti growth factor treatment) are 
indicative of response to treatment (e.g. tumor shrinkage), the treatment should be endured even 
though "adverse" drug responses occur and side effects should be treated separately by agents 



BHC03 1 051 



-48- 

softening the symptoms. Skin diseases such as rash and hand and foot syndrom are just examples 
for a given side effect under a given treatment (i.e. anti tumor therapy), that can be used for 
response correlation. 

Similiarly to blocking receptor molecules itself, blocking downstream members of these signaling 
cascades results mainly in skin diseases (e.g. hand-and-foot syndroms). Surprisingly, we did 
observe, that treating tumor cells with agents blocking the EGFR/Her-2/neu signaling (e.g. 
Cetuximab, Iressa, Herceptin, RAF kinase inhibitor, etc.) shifts the expression of specific keratins 
being part of the ARCHEONS described in this invention. Moreover, the altered expression of 
keratins in tumor cells of patients is paralleled by a shift of keratin expression in the keratinocytes 
of the skin of the very same patient. Perturbation of keratin expression and or post-transcriptional 
modification in the skin tissue seems to resemble the suscebility of the endogenous growth factor 
signaling pathways to the respective treatment. The resulting skin diseases are therefore at least ^ 
some extent indicative of the tumor responsiveness to the regimen. This endogenous 
responsiveness to anti growth factor signaling agents can also be delineated from polymorphisms 
and genetic alterations (e.g. mutations) being present within the ARCHEON described in this 
invention. Of particular interest are in this context polymorphisms being present in the keratin 
genes. However, polymorphisms within keratins, keratin related genes and/or genes functionally 
connected to the keratin-based cytoskeleton, not necessarily being present within the ARCHEONs 
described, are also of importance according to their physical interaction with the respective gene 
products (e.g. ITGB4). It is part of this invention, that the responsiveness of a given tumor to anti 
growth factor treatment relates to the genetic predisposition of the respective signaling pathway 
members and target genes, which include keratins and related genes, that are markers for 
proliferation, differentiation and apoptosis in normal tissues, such as skin tissue. This knowledge 
can be used to predict the responsiveness of a tumor based on the characterization of surro^^ 
tissues, such as skin, blood and any other normal tissue containing the above mentioned genes 
and/or gene products. For example the responsiveness to Iressa, RAF-kinase inhibitor and 
antibody based therapies targeting EGFR and Her-2/neu can be delineated from punch biopsies of 
the skin (preferably by comparison of pre- and/or post-treatment samples) or blood samples by 
determining the expression pattern .or genetic characterization of keratin or keratin-related genes of 
an individual patient. Moreover, the responsiveness of such surrogate tissues can then be 
correlated to the tumor phenotype and the responsiveness of a tumor to the respective treatment, 
thereby predict therapy outcome. The examples of surrogate tissues are given by way of 
illustration and not by limitation. 
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It is yet another embodiment of the invention, that adverse drug responses such as heart toxicities 
can also be deduced from characteristica of the ARCHEONs described. Of particular interest are i 
the ARCEONs at 17ql2-24, 12ql3 and 3q21-26. It is known that anthracyclin based, anti-cancer 
regimens result in heart toxicities (such as dilated cardiomyopathies), as can be deduced e.g. by 
LVEF measurements. Moreover, anthracyclin pretreated patients have significantly increased heart 
toxicity events upon subsequent Herceptin™ based regimen. Interestingly, the ARCHEONs 
described in this invention not only harbor the primary targets of these therapies (i.e. 
topoisomerases and Her-2/neu), but also important structural and functional genes (Telethonin, 
PNMT, CACNB1, PPARBP, Her-2/neu, Her4) for muscle function including heart muscle 
function. These genes are involved in central processes of heart muscle function, such as tyrosine 
phosphorylation, serine/threonine phosphorylation, calcium influx, regulating e.g. central 
structural proteins such as titin. Moreover, these genes can be colocalized in heart muscles, 
dispaying their functional interplay in this tissue. In mouse models, the mislocalization of 
telethonin and the genetic inactivation of Her-2/neu, Her4 and Neuregulin result in a similiar 
phenotype as can be seen for cancer patients being treated with diverse anti-cancer drugs. The 
synergistic adverse drug response effect seen for the combinatorial treatment with anthracyclin 
and Herceptin™. Delineation of polymorphisms and haplotypes of the respective genes,: genomic 
region and/or the ARCHEON structure are indicative of the susceptibility to suffer from heart 
toxicities upon anti-cancer drug treatment. This is important for therapy decisions and cancer 
treatment management, as the prior therapies conducted exclude subsequent treatment options. For 
example, anthracyclin-based pretreatment can exclude subsequent Herceptin™ treatment or lead 
to reduced dosages, if possible heart toxicities (e.g. dilated cardiomyopathies) cannot be excluded. 

According to the observations described above the following examples of genes at 3q21-26 are 
offered by way of illustration, not by way of limitation. 

WNT5A, CACNA1D, THRB, RARB, TOP2B, RAB5B, SMARCC1 (BAF155), RAF, 
WNT7A 

The following examples of genes at 12ql3 are offered by way of illustration, not by way of 
limitation. 

CACNB3, Keratins, ERBB3, NR4A1, RAB5/13, RARG, STAT6, WNT10B, (GCN5), (SAS: 
Sarcoma Amplified Sequence), SMARCC2 (BAF170), SMARCD1 (BAF60A), (GAS41: 
Glioma Amplified Sequence), (CHOP), Her3, KRTHB, HOX C , IGFBP6, WNT5B 
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There is cross-talk between the amplified ARCHEONs described above and some other highly 
amplified genomic regions locate approximately at lpl3, lq32, 2pl6, 2q21, 3pl2, 5pl3, 6pl2, 
7pl2, 7q21, 8q23, llql3, 13ql2, 19ql3, 20ql3 and 21qll. The above mentioned chromosomal 
regions are described by way of illustration not by way of limitation, as the amplified regions often 
5 span larger and/or overlapping positions at these chromosomal positions. 

Additional alterations of non-transcribed genes, pseudogenes or intergenic regions of said 
chromosomal locations can be measured for prediction, diagnosis, prognosis, prevention and 
. . treatment of malignant neoplasia and breast cancer in particular. Some of the genes or genomic 
regions have no direct influence on the members of the ARCHEONs or the genes within distinct 
10 chromosomal regions but still retain marker gene function due to their chromosomal positioning in 
the neighborhood of functionally critical genes (e.g. Telethonin neighboring the Her-2/neu gene). 

Clinical Relevance of the genes which are part of the 17q21 Archeon for Response to 
Hercentin treatment 

Clinical Samples of patients being treated with Herceptin, docetaxel, paclitaxel, taxotere, 
15 carboplatin, cisplatin, oxaliplatin, vinorelbine, tamoxifen, anastrozole, letrozole, tamoxifen, 
epirubicin, doxorubicin and CMF were obtained. Primary tumor tissues and lymphnode tissues 
were obtained from neoadjuvant and adjuvant settings. In addition, biopsy material of first and 
second line therapies was obtained in some cases from metastatic lesions. These samples included 
formalin-fixed and paraffin-embedded material or fresh tissue from primary tumours and 
20 metastatic lesions of the respective patients. Moreover, whole blood, serum and plasma samples 
were included in the analysis. 

Multiparametric, clinical assessment of the response to Herceptin in combination w^) 
chemotherapeutics (e.g. docetaxel, taxotere, paclitaxel, vinorelbine, carboplatin, cisplatin), or 
other therapies described below, was performed, based on clinical information, such as histo- 

25 logical parameters (TNM-Stage, AJCC grade), standard molecular markers (IHC staining for 

estrogen receptor, progesteron receptor, Her-2/neu) and sonographical or radiological assessment 
(e.g. CT). In addition to combinatorial treatment, samples from single agent therapies were 
evaluated. Response to treatment was evaluated according to international standards. The 
ARCHEON genes were analyzed on DNA, RNA or protein level. Normalization of the 

30 ARCHEON genes was done by intra- or extrachromosomal reference genes (see EXAMPLE 3 
below) or by housekeeping genes of diverse expression level. 
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We could delineate specific regions of the ARCHEON to be informative for the response tc 
Herceptin-based therapy. As depicted below, genes that are located towards the centromer 
telomer of an individual chromosome in relation to a centrally localized gene within 
ARCHEON (e.g. Her-2/neu in the 17q21 ARCHEON) are in the following named to 
..centromeric" and ..telomeric", respectively. Of particular interest for response to Herceptin-bas 
treatment are genes being centromeric from the Her-2/neu gene locus on 17q21. The integrity 
this centromeric ARCHEON region is of importance for the phenotype of Her-2/neu positr 
tumors. Genetic alteration in the chromosomal region of PIP5K2B, FU20291, MLN50 TEM 
CACNB1, RPL19, MGC15482, PPARBP, CrkRS are critical for clinical outcome of Her-2/ne 
positive breast tumors. Of particular interest is the centromeric breakpoint region of the 17q2 
ARCHEON nearby the genes TEM7, CACNB1, CrkRS and PPARBP. Her-2/neu positive tumo, 
bearing elevated gene copy numbers of TEM7, CACNB1, CrkRS and PPARBP compared to oth< 
Her-2/neu positive tumors and/or normal tissue controls do have a worsened clinical outcome an 
a poor response to Herceptin based treatment. The genes within this region are involved in calciur 
and inositol signalling, which is fundamental with regard to cell survival mechanisms (e g 
CACNB1, PPP1R1B and PIP5K2B). Overexpression of CrkRS is of importance for the tumo 
phenotype, as its kinase activity regultaes the RNA polymerase U holoenzyme complex 
Especially the phosphorylation of the C-terminal domain and its associated components not onh 
has influence on the general activity of the enzyme complex, but also affects gene products, whosi 
importance for tumor cell growth has been demonstrated and some of which are part of the 
ARCHEONs (e.g. the SWI/SNF components SMARCs, e.g. SMARCC2, are critical for RB 
mediated tumor suppression). Phosphorylation of SMARCs is tightly regulated during cell cycle 
progression and affects the biological function of the SMARCs (influence on activity, stability 
and cellular localization). Altered phosphorylation of the RNA polymerase holoenzyme complex 
by CrkRS therefore most probably affects cell cycle progression. Moreover, the abnormal 
expression of TEM7, which we found to be elevated in a subclass of Her-2/neu positive tumors, 
whereas it was originally identified to be a tumor endothelial marker (TEM; see above), points' 
towards an intense interplay between tumor and endothelial cells resulting in a more aggressive 
behaviour of the respective tumor cells during metastasic processes such as intra- and 
extravasation. Strikingly, the genes within this region, i.e. ZNF144, TEM7, PIP5K, PPP1R1B and 
CACNB1, all do have physiological functions within the central nervous system. Therefore, we do 
assume, that a ..neuronal environment" would be favourable for tumor cells overexpressing these 
genes resulting in growth and survival advantages for these particular tumor cells. In accordance 
with this, it is observed that Herceptin resistant metastasis frequently occur in the brain. So far it 
has been discussed, that this observation refers to topological problems such as drug- 
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biovailability with respect to the blood brain barrier. It is part of this invention, that genes which 
are normally expressed within neuronal cells are integral part of the centromeric gene cluster of 
the ARCHEON on chromosome 17q21 and are involved in de novo and acquired resistance to 
Herceptin based treatment. Independent amplification units and/or deletion of singular genes of 
this centromeric cluster due to chromosomal breakage interferes with the survival and resistance 
function of this genomic region. Therefore the continuity of amplification units is another 
important feature with regard to responsivenessor unresponsiveness to therapy. It is noteworthy to 
mention, that not only the presence of particular genes, but also the presence of regulatory 
elements within this genomic region contribute to the above mentioned biological phenotype. 
Therefore also the loss or gain of regulatory elements within the centromeric part of the 
ARCHEON is of importance for resistance to anti cancer treatment and therefore part of this 
invention. ^) 

In addition to the alteration of centromeric ARCHEON region, the total length of the ARCHEON 
with regard to the telomeric region and the relative gene copy numbers of the amplified genes are 
of importance. Particularly the integrity of the genomic region harboring the TOP2alpha gene with 
the surrounding genes THRA, NR1D1, MLN51, WIRE, HsCDC6, RARA, CTEN, IGFBP4, EBI1 
and SMARCE1 is of interest. Her-2/neu positive tumors, that are deleted in at least some of this 
genes exhibit a worsened response to herceptin-based chemotherapy. This demonstrates, that this 
region is not only of prognostic value for anthracyclin-based therapy, but also of prognostic value 
for chemotherapeutic treatment with taxol-related agents and platin salts. The amplification, 
deletion or silencing of this telomeric region is accompanied with altered sensitivity to the above 
mentioned chemotherapeutics. This is a general feature of tumors bearing alterations (with regard 
to gene expression and/or amplification of the 17q21 ARCHEON) and not only true for brejt 
cancer. Li line with this, we have analyzed ovarian tumors bearing alterations in the 17qW 
ARCHEON and correlated the clinical outcome, that was assessed similarly as depicted above, 
with regard to a platin salt based therapy. Strikingly, tumors with defined genetic patterns within 
this telomeric regions did develop resistance to this chemotherapeutic regimen. Detecting solely 
the coamplification of Her-2/neu and TOP2alpha was not as informative with regard to response 
prediction as a detailed characterization and subsequent response correlation with the region of the 
THRA, NR1D1, MLN51, WIRE, HsCDC6 and RARA genes. It is part of this invention, that the 
proliferation status of tumors is affected by genes within ARCHEON regions. The 17q2l 
ARCHEON determines to at least some extent the proliferation rate of tumor cells. Interestingly, 
Her-2/neu positive tumors bearing elevated levels of a more limited number of genes, excluding 
several genes in the telomeric region (i.e. TOP2alpha, HsCDC6) exhibit a relatively slow growth 
rate, which diminishes the effect of chemotherapeutic drugs targeting proliferating cells and is one 
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of the reasons for the resistance of these tumors to said agents. Instead, these tumors have a highei 
capacity with regard to invasiveness and do have a diminished apoptotic rate, which to some 
extent refers to the signaling of Her-2/neu via GRB7 and AKT kinase (also affected by inositols 
and calcium, see above), respectively. Several genes within the telomeric region of the 
ARCHEONs affect Her-2/neu signalling, such as RARA, THRA, IGFBP4, and alter the respective 
characteristics of the cells including proliferation status. 

The ARCHEONs being part of this invention, are not only important for clinical response oi 
tumors to antibody-based therapies raised against EGFR- and Her-2/neu signaling (e.g. Herceptin, 
2C4 or cetuximab regimen) and to chemotherapeutic agents, but also are of importance for diverse 
strategies of anti hormonal treatment (e.g. Tamoxifen, Raloxifen, anastrozol, letrozol, faslodex). In 
particular, elevated levels of the PPARBP gene and protein and the integrity of the telomeric 
hormone receptor region of the 17q21 ARCHEON, bearing THRA, NR1D1 and RARA, or its 
related regions on the other ARCHEONs are of importance for these therapeutic regimens. In a 
retrospective, clinical study evaluating the above mentioned clinical parameters for adjuvant 
treatment of breast cancer with tamoxifen, we did observe, that the overexpression of PPARBP 
has impact on the overall survival of patients receiving this therapy. Overexpression of PPARBP 
enables activity of estrogen and progesteron receptors irrespective of a bound ligand. Therefore, 
the deregulation of the PPARBP results in the activity of these hormone receptors in the absence 
of the hormones and even in the presence of anti-hormones and thereby circumvents the anti 
tumor effect of anti hormonal strategies resulting in resistance of PPARBP overexpressing cells. In 
addition overexpression of hormone receptors other than estrogen receptor in tumor cells affects 
activity of estrogen or the respective anti-hormones by competition for dimerization partners, such 
as RXR, or transcriptional activator or repressor genes, such as CBP or NCOR. With regard to 
tamoxifen treatment this clearly diminishes the effect of the anti-hormone, as the pool of the 
transcriptional cofactors is reduced for the classical mode of action of tamoxifen within the 
nucleus.. 

The invention further relates to the use of: 

A) a polynucleotide comprising at least one of the sequences of SEQ ID NO: 1 to 26 or 53 to 
75; 

B) a polynucleotide which hybridizes under stringent conditions to a polynucleotide specified 
in (a) encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 
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C) a polynucleotide the sequence of which deviates from the polynucleotide specified in (a) 
and (b) due to the generation of the genetic code encoding a polypeptide exhibiting the 
same biological function as specified for the respective sequence in Table 2 or 3 

D) a polynucleotide which represents a specific fragment, derivative or allelic variation of a 
polynucleotide sequence specified in (a) to (c) 

E) an antisense molecule targeting specifically one of the polynucleotide sequences specified 
in (a) to (d); 

F) a purified polypeptide encoded by a polynucleotide sequence specified in (a) to (d) 

G) a purified polypeptide comprising at least one of the sequences of SEQ ID NO: 27 to 52 or 




H) an antibody capable of binding to one of the polynucleotide specified in (a) to (d) or a 
polypeptide specified in (f) and (g) 

I) a reagent identified by any of the methods of claim 14 to 16 that modulates the amount or 
activity of a polynucleotide sequence specified in (a) to (d) or a polypeptide specified in 
(f) and (g) 

In the preparation of a composition for the prevention, prediction, diagnosis, prognosis or a 
medicament for the treatment of malignant neoplasia and breast cancer in particular. 

Polynucleotides 

A „BREAST CANCER GENE" polynucleotide can be single- or double-stranded and comprise^ 
coding sequence or the complement of a coding sequence for a „BREAST CANCER GENE" 
polypeptide. Degenerate nucleotide sequences encoding human ,,BKJBAST CaNCER GENE" 
polypeptides, as well as homologous nucleotide sequences which are at least about 50, 55, 60, 65, 
70, preferably about 75, 90, 96, or 98% identical to the nucleotide sequences of SEQ ID NO: 1 to 
26or 53 to 75 also are ,JBREAST CANCER GENE" polynucleotides. Percent sequence identity 
between the sequences of two polynucleotides is determined using computer programs such as 
ALIGN which employ the FASTA algorithm, using an affine gap search with a gap open penalty 
of -12 and a gap extension penalty of -2. Complementary DNA (cDNA) molecules, species 
homologues, and variants of „BREAST CANCER GENE" polynucleotides which encode 
biologically active ,3REAST CANCER GENE" polypeptides also are „BREAST CANCER 
GENE" polynucleotides. 
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Preparation of Polynucleotides 



A naturally occurring BREAST CANCER GENE" polynucleotide can be isolated free of oth 
cellular components such as membrane components, proteins, and lipids. Polynucleotides can 1 
made by a cell and isolated using standard nucleic acid purification techniques, or synthesis 
using an amplification technique, such as the polymerase chain reaction (PCR), or by using z 
automatic synthesizer. Methods for isolating polynucleotides are routine and are known in the ai 
Any such technique for obtaining a polynucleotide can be used to obtain isolated J3REAS 
CANCER GENE" polynucleotides. For example, restriction enzymes and probes can be used t 
isolate polynucleotide fragments which comprises .3REAST CANCER GENE" nucleotid 
sequences. Isolated polynucleotides are in preparations which are free or at least 70, 80, or 90? 
free of other molecules. 

.BREAST CANCER GENE" cDNA molecules can be made with standard molecular biolog 
techniques, using ..BREAST CANCER GENE" mRNA as a template. Any RNA isolatio, 
technique which does not select against the isolation of mRNA may be utilized for the purificatio. 
of such RNA samples. See, for example, Sambrook et al., 1989, (77); and Ausubel, F. M. et al. 
1989, (78), both of which are incorporated herein by reference in their entirety. Additionally, larg< 
numbers of tissue samples may readily be processed using techniques well known to those of skil 
in the art, such as, for example, the single-step RNA isolation process of Chomczynski, P. (1989. 
U.S. Pat. No. 4,843,155), which is incorporated herein by reference in its entirety. 

■BREAST CANCER GENE" cDNA molecules can thereafter be replicated using molecular 
biology techniques known in the art and disclosed in manuals such as Sambrook et al., 1989, (77) . 
An amplification technique, such as PCR can be used to obtain additional copies of 
polynucleotides of the invention, using either human genomic DNA or cDNA as a template. 

Alternatively, synthetic chemistry techniques can be used to synthesizes .3REAST CANCER 
GENE" polynucleotides. The degeneracy of the genetic code allows alternate nucleotide sequences 
to be synthesized which will encode a BREAST CANCER GENE" polypeptide or a biologically 
active variant thereof. 

Identification of diffe rential expression 

Transcripts within the collected RNA samples which represent RNA produced by differentially 
expressed genes may be identified by utilizing a variety of methods which are ell known to those 
of skill in the art. For example, differential screening [Tedder, T. F. et al., 1988, (79)], subtractive 
hybridization [Hedrick, S. M. et al., 1984, (80); Lee, S . W. et al., 1984, (81)], and, preferably, 
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differential display (Liang, P., and Pardee, A. B., 1993, U.S. Pat. No. 5,262,311, which is 
incorporated herein by reference in its entirety), may be utilized to identify polynucleotide 
sequences derived from genes that are differentially expressed. 

Differential screening involves the duplicate screening of a cDNA library in which one copy of the 
library is screened with a total cell cDNA probe corresponding to the mRNA population of one 
cell type while a duplicate copy of the cDNA library is screened with a total cDNA probe 
corresponding to the mRNA population of a second cell type. For example, one cDNA probe may 
correspond to a total cell cDNA probe of a cell type derived from a control subject, while the 
second cDNA probe may correspond to a total cell cDNA probe of the same cell type derived from 
an experimental subject. Those clones which hybridize to one probe but not to the other potentially 
represent clones derived from genes differentially expressed in the cell type of interest in control 
versus experimental subjects. 

.Sub-tractive hybridization techniques generally involve the isolation of mRNA taken from two 
different sources, e.g., control and experimental tissue, the hybridization of the mRNA or single- 
stranded cDNA reverse-transcribed from the isolated mRNA, and the removal of all hybridized, 
and therefore double-stranded, sequences. The remaining non-hybridized, single-stranded cDNAs, 
potentially represent clones derived from genes that are differentially expressed in the two mRNA 
sources. Such single-stranded cDNAs are then used as the starting material for the construction of 
a library comprising clones derived from differentially expressed genes. 

The differential display technique describes a procedure, utilizing the well known polymerase 
chain reaction (PCR; the experimental embodiment set forth in Mullis, K. B., 1987, U.S. Pat. No. 
4,683,202) which allows for the identification of sequences derived from genes which 
differentially expressed. First, isolated RNA is reverse-transcribed into single-stranded cDl 
utilizing standard techniques which are well known to those of skill in the art. Primers for the 
reverse transcriptase reaction may include, but are not limited to, oligo dT-containing primers, 
preferably of the reverse primer type of oligonucleotide described below. Next, this technique uses 
pairs of PCR primers, as described below, which allow for the amplification of clones representing 
a random subset of the RNA transcripts present within any given cell. Utilizing different pairs of 
primers allows each of the mRNA transcripts present in a cell to be amplified. Among such 
amplified transcripts may be identified those which have been produced from differentially 
expressed genes. 

The reverse oligonucleotide primer of the primer pairs may contain an oligo dT stretch of 
nucleotides, preferably eleven nucleotides long, at its 5* end, which hybridizes to the poly(A) tail 
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of mRNA or to the complement of a cDNA reverse transcribed from an mRNA poly(A) tai 
Second, in order to increase the specificity of the reverse primer, the primer may contain one c 
more, preferably two, additional nucleotides at its 3' end. Because, statistically, only a subset o 
the mRNA derived sequences present in the sample of interest will hybridize to such primers, th 
additional nucleotides allow the primers to amplify only a subset of the mRNA derived sequence 
present in the sample of interest. This is preferred in that it allows more accurate and complet 
visualization and characterization of each of the bands representing amplified sequences. 

The forward primer may contain a nucleotide sequence expected, statistically, to have the abilitj 
to hybridize to cDNA sequences derived from the tissues of interest. The nucleotide sequence maj 
be an arbitrary one, and the length of the forward oligonucleotide primer may range from about S 
to about 13 nucleotides, with about 10 nucleotides being preferred. Arbitrary primer sequence 
cause the lengths of the amplified partial cDNAs produced to be variable, thus allowing different 
clones to be separated by using standard denaturing sequencing gel electrophoresis. PCR reaction 
conditions should be chosen which optimize amplified product yield and specificity, and, 
additionally, produce amplified products of lengths which may be resolved utilizing standard gel 
electrophoresis techniques. Such reaction conditions are well known to those of skill in the art, and 
important reaction parameters include, for example, length and nucleotide sequence of 
oligonucleotide primers as discussed above, and annealing and elongation step temperatures' and 
reaction times. The pattern of clones resulting from the reverse transcription and amplification of 
the mRNA of two different cell types is displayed via sequencing gel electrophoresis and 
compared. Differences in the two banding patterns indicate potentially differentially expressed 
genes. 

When screening for full-length cDNAs, it is preferable to use libraries that have been size-selected 
to include larger cDNAs. Randomly-primed libraries are preferable, in that they will contain more 
sequences which contain the 5' regions of genes. Use of a randomly primed library may be 
especially preferable for situations in which an oligo d(T) library does not yield a full-length 
cDNA. Genomic libraries can be useful for extension of sequence into 5' nontranscribed regulatory 
regions. 

Commercially available capillary electrophoresis systems can be used to analyze the size or 
confirm the nucleotide sequence of PCR or sequencing products. For example, capillary 
sequencing can employ flowable polymers for electrophoretic separation, four different 
fluorescent dyes (one for each nucleotide) which are laser activated, and detection of the emitted 
wavelengths by a charge coupled device camera. Output/light intensity can be converted to 
electrical signal using appropriate software (e.g. GENOTYPER and Sequence NAVIGATOR, 
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Perkin Elmer; ABI), and the entire process from loading of samples to computer analysis and 
electronic data display can be computer controlled. Capillary electrophoresis is especially 
preferable for the sequencing of small pieces of DNA which might be present in limited amounts 
in a particular sample. 

5 Once potentially differentially expressed gene sequences have been identified via bulk techniques 
such as, for example, those described above, the differential expression of such putatively 
differentially expressed genes should be corroborated. Corroboration may be accomplished via, for 
example, such well known techniques as Northern analysis and/or RT-PCR. Upon corroboration, 
the differentially expressed genes may be further characterized, and may be identified as target 

10 and/or marker genes, as discussed, below. 

Also, amplified sequences of differentially expressed genes obtained through, for exampl^ 
differential display may be used to isolate full length clones of the corresponding gene. The full 
length coding portion of the gene may readily be isolated, without undue experimentation, by 
molecular biological techniques well known in the art. For example, the isolated differentially 
15 expressed amplified fragment may be labeled and used to screen a cDNA library. Alternatively, 
the labeled fragment may be used to screen a genomic library. 

An analysis of the tissue distribution of the mRNA produced by the identified genes may be 
conducted, utilizing standard techniques well known to those of skill in the art. Such techniques 
may include, for example, Northern analyses and RT-PCR. Such analyses provide information as 
20 to whether the identified genes are expressed in tissues expected to contribute to breast cancer. 

Such analyses may also provide quantitative information regarding steady state mRNA regulation, 
yielding data concerning which of the identified genes exhibits a high level of regulation in, 
preferably, tissues which may be expected to contribute to breast cancer. ^ 

Such analyses may also be performed on an isolated cell population of a particular cell type 
25 derived from a given tissue. Additionally, standard in situ hybridization techniques may be utilized 
to provide information regarding which cells within a given tissue express the identified gene. 
Such analyses may provide information regarding the biological function of an identified gene 
relative to breast cancer in instances wherein only a subset of the cells within the tissue is thought 
to be relevant to breast cancer. 

30 Identification of co-amplified penes 

Genes involved in genomic alterations (amplifications, insertions, translocations, deletions, etc.) 
are identified by PCR-based karyotyping in combination with database analysis. Of particular 



BHC03 1 051 



-59- 

interest are gene amplifications, which account for gene copy numbers >2 per cell. Gene cop> 
number and gene expression of the respective genes often correlates. Therefore clusters of genes 
being simultaneously overexpressed due to gene amplifications can be identified by expression 
analysis via DNA-chip technologies or quantitative RTPCR. For example, the altered expression 
5 of genes due to increased or decreased gene copy numbers can be determined by GeneArray™ 
technologies from Asymetrix or qRT-PCR with the TaqMan or iCycler Systems. Moreover 
combination of RNA with DNA analytic enables highly parallel and automated characterization of 
multiple genomic regions of variable length with high resolution in tissue or single cell samples. 
Furthermore these assays enable the correlation of gene transcription relative to gene copy number 
10 of target genes. As there is not necessarily a linear correlation of expression level and gene copy 
number and as there are synergistic or antagonistic effects in certain gene clusters, the 
identification on the RNA-level is easier and probably more relevant for the biological outcome of 
the alterations especially in tumor tissue. 

Detection of co-amplified genes in malignant neoplasia 

Chromosomal changes are commonly detected by FISH (=Fluorescence-In-Situ-Hybridization) 
and CGH (=Comparativ6 Genomic Hybridization). For quantification of genomic regions genes or 
intergenic regions can be used. Such quantification measures the relative abundance of multiple 
genes with respect to each other (e.g. target gene vs. centromeric region or housekeeping genes). 
Changes in relative abundance can be detected in paraffin-embedded material even after extraction 
of RNA or genomic DNA. Measurement of genomic DNA has advantages compared to RNA- 
analysis due to the stability of DNA, which accounts for the possibility to perform also 
retrospective studies and offers multiple internal controls (genes not being altered, amplified or 
deleted) for standardization and exact calculations. Moreover, PCR-analysis of genomic DNA 
offers the advantage to investigate intergenic, highly variable regions or combinations of SNP's 
(=Single Nucleotide Polymorphisms), RFLPs, VNTRs and STRs (in general polypmorphic 
markers). Determination of SNPs or polypmorphic markers within defined genomic regions (e.g. 
SNP analysis by "Pyrosequencing™") has impact on the phenotype of the genomic alterations. For 
example it is of advantage to determine combinations of polymorphisms or haplotypes in order to 
characterize the biological potential of genes being part of amplified alleles. Of particular interest 
are polypmorphic markers in breakpoint regions, coding regions or regulatory regions of genes or 
intergenic regions. By determining predictive haplotypes wilh defined biological or clinical 
outcome it is possible to establish diagnostic and prognostic assays with non-tumor samples from 
patients. Depending on whether preferably one allele or both alleles to same extent are amplified 
(= linear or non-linear amplifications) haplotypes can be determined. Overrepresentation of 
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specific polypmorphic markers combinations in cells or tissues with gene amplifications facilitates 
haplotype determination, as e.g. combinations of heterozygous polypmorphic markers in nucleic 
acids isolated from normal tissues, body fluids or biological samples of one patient become almost 
homozygous in neoplastic tissue of the very same patient. This "gain of homozygosity" 
corresponds to the measurement of altered genomic region due to amplification events and is 
suitable for identification of "gain of function"- alterations in tumors, which result in e.g. 
oncogenic or growth promoting activities. In contrast, the detection of "losses of heterozygosity" 
is used for identification of anti-oncogenes, gate keeper genes or checkpoint genes, that suppress 
oncogenic activities and negatively regulate cellular growth processes. This intrinsic difference 
clearly opposes the impact of the respective genomic regions for tumor development and 
emphasizes the significance of "gain of homozygosity" measurements disclosed in this invention. 
In addition to the analyses on SNPs, a comparative approach of blood leucocyte DNA and tumg^ 



DNA based on VNTR detection can reveal the existance of a formerely described ARCHEON7 
SNP and VNTR sequences and primer sets most suitable for detection of theARCHEON at 17ql 1- 
21 are disclosed in Table 4 and Table 6. Detection, quantification and sizing of such polymorphic 
markers can be achieved by methods known to those with skill in the art. In one embodiment of 
this invention we disclose the comparative measurement of amount and size of any of the 
disclosed VNTRs (Table 6) by PCR amplification and capillary electrophoresis. PCR can be 
carried out by standart protocols favorably in a linear amplification range (low cycle number) and 
detection by CE should be carried out by suppliers protocols (e.g. Agilent). More favorably the 
detection of the VNTRs disclosed in Table 6 can be carried out in a multiplex fashion, utilizing a 
variety of labeled primers (e.g. fluoreszent, radioactive, bioactive) and a suitable CE detection 
system (e.g. ABI 310). However the detection can also be performed on slab gels consiting of 
highly concentrated agarose or polyacrylamide with a monochromal DNA stain. Enhancemen^J^ 
resolution can be achieved by appropriate primer design and length variation to give best results in 



It is also of interest to determine covalent modifications of DNA (e.g. methylation) or the 
associated chromatin (e.g. acetylation or methylation of associated proteins) within the altered 
genomic regions, that have impact on transcriptional activity of the genes. In general, by 
measuring multiple, short sequences (60-300 bp) these techniques enable high-resolution analysis 
of target regions, which cannot be obtained by conventional methods such as FISH analytic (2-100 
kb). Moreover the PCR-based DNA analysis techniques offer advantages with regard to 
sensitivity, specificity, multiplexing, time consumption and low amount of patient material 
required. These techniques can be optimized by combination with microdissection or 
macrodissection to obtain purer starting material for analysis. 
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Extendinp PnlynurlentMe* 

In one embodiment of such a procedure for the identification and cloning of full length ge 
sequences, RNA may be isolated, following standard procedures, from an appropriate tissue 
cellular source. A reverse transcription reaction may then be performed on the RNA using , 
oligonucleotide primer complimentary to the mRNA that corresponds to the amplified fragmex 
for the priming of first strand synthesis. Because the primer is anti-parallel to the mRN, 
extension will proceed toward the 5' end of the mRNA. The resulting RNA hybrid may then I 
"tailed" with guanines using a standard terminal transferase reaction, the hybrid may be digeste 
with RNase H, and second strand synthesis may then be primed with a poly-C primer. Using ft 
two primers, the 5' portion of the gene is amplified using PCR. Sequences obtained may then b 
isolated and recombined with previously isolated sequences to generate a full-length cDNA of th 
differentially expressed genes of the invention. For a review of cloning strategies and recombinar 
DNA techniques, see e.g., Sambrook et al., (77); and Ausubel et al., (78). 

Various PCR-based methods can be used to extend the polynucleotide sequences disclosed hereii 
to detect upstream sequences such as promoters and regulatory elements. For example, restrictioi 
site PCR uses universal primers to retrieve unknown sequence adjacent to a known locus [Sarkar 
1993, (82)]. Genomic DNA is first amplified in the presence of a primer to a linker sequence and S 
primer specific to the known region. The amplified sequences are then subjected to a second rounc 
of PCR with the same linker primer and another specific primer internal to the first onejproducte 
of each round of PCR are transcribed with an appropriate RNA polymerase and sequenced using 
reverse transcriptase. 

Inverse PCR also can be used to amplify or extend sequences using divergent primers based on a 
known region [Triglia et al., 1988 ,(83)]. Primers can be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences Inc., Plymouth, 
Minn.), to be e.g. 2230 nucleotides in length, to have a GC content of 50% or more, and to anneal 
to the target sequence at temperatures about 68-72°C. The method uses several restriction enzymes 
to generate a suitable fragment in the known region of a gene. The fragment is then circularized by 
intramolecular ligation and used as a PCR template. 

Another method which can be used is capture PCR, which involves PCR amplification of DNA 
fragments adjacent to a known sequence in human and yeast artificial chromosome DNA 
[Lagerstrom et al., 1991, (84)]. In this method, multiple restriction enzyme digestions and ligations 
also can be used to place an engineered double-stranded sequence into an unknown fragment of 
the DNA molecule before performing PCR. 
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Additionally, PCR, nested primers, and PROMOTERFINDER libraries (CLONTECH, Palo Alto, 
Calif.) can be used to walk genomic DNA (CLONTECH, Palo Alto, Calif.). This process avoids 
the need to screen libraries and is useful in finding intron/exon junctions. 

The sequences of the identified genes may be used, utilizing standard techniques, to place the 
genes onto genetic maps, e.g., mouse [Copeland & Jenkins, 1991, (85)] and human genetic maps 
[Cohen, et al., 1993 ,(86)]. Such mapping information may yield information regarding the genes' 
importance to human disease by, for example, identifying genes which map near genetic regions to 
which known genetic breast cancer tendencies map. 

Identification of polynucleotide variants and homoloeues or splice variants 

Variants and homologues of the .3REAST CANCER GENE" polynucleotides described above 
also are ,J3REAST CANCER GENE" polynucleotides. Typically, homologous „BREASW 
CANCER GENE" polynucleotide sequences can be identified by hybridization of candidate 
polynucleotides "to known" 'J3REAST CANCER GENE" polynucleotides under stringent 
conditions, as is known in the art. For example, using the following wash conditions: 2 X SSC 
(0.3 M NaCl, 0.03 M sodium citrate, pH7.0), 0.1% SDS, room temperature twice, 30 minutes 
each; then 2 X SSC, 0.1% SDS, 50 EC once, 30 minutes; then 2 X SSC, room temperature twice, 
10 minutes each homologous sequences can be identified which contain at most about 25-30% 
basepair mismatches. More preferably, homologous polynucleotide strands contain 15-25% 
basepair mismatches, even more preferably 5-15% basepair mismatches. 

Species homologues of the ..BREAST CANCER GENE" polynucleotides disclosed herein also can 
be identified by making suitable probes or primers and screening cDNA expression libraries from 
other species, such as mice, monkeys, or yeast. Human variants of ..BREAST CANCER GEN^ 
polynucleotides can be identified, for example, by screening human cDNA expression libraries. It 
is well known that the T m of a double-stranded DNA decreases by 1-1. 5°C with every 1% decrease 
in homology [Bonner et al., 1973, (87)]. Variants of human ..BREAST CANCER GENE" 
polynucleotides or ..BREAST CANCER GENE" polynucleotides of other species can therefore be 
identified by hybridizing a putative homologous ..BREAST CANCER GENE" polynucleotide with 
a polynucleotide having a nucleotide sequence of one of the sequences of the SEQ ID NO: 1 to 26 
or 53 to 75 or the complement thereof to form a test hybrid. The melting temperature of the test 
hybrid is compared with the melting temperature of a hybrid comprising polynucleotides having 
perfectly complementary nucleotide sequences, and the number or percent of basepair mismatches 
within the test hybrid is calculated. 
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Nucleotide sequences which hybridize to „BREAST CANCER GENE" polynucleotides or their 
complements following stringent hybridization and/or wash conditions also are „BREAST 
CANCER GENE" polynucleotides. Stringent wash conditions are well known and understood in 
the art and are disclosed, for example, in Sambrook et al., (77). Typically, for stringent 
hybridization conditions a combination of temperature and salt concentration should be chosen 
that is approximately 12-20°C below the calculated T m of the hybrid under study. The T m of a 
hybrid between a „BREAST CANCER GENE" polynucleotide having a nucleotide sequence of 
one of the sequences of the SEQ ID NO: 1 to 26 or 53 to 75 or the complement thereof and a 
polynucleotide sequence which is at least about 50, preferably about 75, 90, 96, or 98% identical 
to one of those nucleotide sequences can be calculated, for example, using the equation below 
[Bolton and McCarthy, 1962, (88): 

T m = 81.5°C - l^OogjolNa 4 ]) + 0.41(%G + C) - 0.63(%formamide) - 600/1), 

where 1 = the length of the hybrid in basepairs. 

Stringent wash conditions include, for example, 4 X SSC at 65°C, or 50% formamide, 4 X SSC at 
28°C, or 0.5 X SSC, 0.1% SDS at 65°C. Highly stringent wash conditions include, for example, 
0.2XSSCat65°C. 

The biological function of the identified genes may be more directly assessed by utilizing relevant 
in vivo and in vitro systems. In vivo systems may include, but are not limited to, animal systems 
which naturally exhibit breast cancer predisposition, or ones which have been engineered to 
exhibit such symptoms, including but not limited to the apoE-deficient malignant neoplasia mouse 
model [Plump et al., 1992, (89)]. 

Splice variants derived from the same genomic region, encoded by the same pre mRNA can be 
identified by hybridization conditions described above for homology search. The specific 
characteristics of variant proteins encoded by splice variants of the same pre transcript may differ 
and can also be assayed as disclosed. A ,3REAST CANCER GENE" polynucleotide having a 
nucleotide sequence of one of the sequences of the SEQ ID NO: 1 to 26 or 53 to 75 or the 
complement thereof may therefor differ in parts of the entire sequence as presented for SEQ ID 
NO: 60 and the encoded splice variants SEQ ID NO: 61 to 66. These refer to individual proteins 
SEQ ID NO: 83 to 89. The prediction of splicing events and the identification of the utilized 
acceptor and donor sites within the pre mRNA can be computed (e.g. Software Package GRAIL 
or GenomeSCAN) and verified by PCR method by those with skill in the art 
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Antisense oligonucleotides 

Antisense oligonucleotides are nucleotide sequences which are complementary to a specific DNA 
or RNA sequence. Once introduced into a cell, the complementary nucleotides combine with 
natural sequences produced by the cell to form complexes and block either transcription or 
translation. Preferably, an antisense oligonucleotide is at least 6 nucleotides in length, but can be at 
least 7, 8, 10, 12, 15, 20, 25, 30, 35, 40, 45, or 50 or more nucleotides long. Longer sequences also 
can be used. Antisense oligonucleotide molecules can be provided in a DNA construct and 
introduced into a cell as described above to decrease the level of ,3REAST CANCER GENE" 
gene products in the cell. 

Antisense oligonucleotides can be deoxyribonucleotides, ribonucleotides, peptide nucleic acids 
(PNAs; described in U.S. Pat. No. 5,714,331), locked nucleic acids (LNAs; described in 
99/12826), or a combination of them. Oligonucleotides can be synthesized manually or by an 
automated synthesizer, by covalently linking the 5 f end of one nucleotide with the 3' end of another 
nucleotide with non-phosphodiester internucieotide linkages such alkylphosphonates, 
phosphorothioates, phosphorodithioates, alkylphosphonothioates, alkylphosphonates, 
phosphoramidates, phosphate esters, carbamates, acetamidate, carboxymethyl esters, carbonates, 
and phosphate triesters [Brown, 1994, (126); Sonveaux, 1994, (127) and Uhlmann et al., 1990, 
(128)]. 

Modifications of JBREAST CANCER GENE" expression can be obtained by designing antisense 
oligonucleotides which will form duplexes to the control, 5', or regulatory regions of the 
,3REAST CANCER GENE". Oligonucleotides derived from the transcription initiation site, e.g., 
between positions 10 and +10 from the start site, are preferred. Similarly, inhibition can he 
achieved using "triple helix" base-pairing methodology. Triple helix pairing is useful becausS 
causes inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or chaperons. Therapeutic advances using triplex DNA have 
been described in the literature [Gee et al., 1994, (129)]. An antisense oligonucleotide also can be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Precise complementarity is not required for successful complex formation between an antisense 
oligonucleotide and the complementary sequence of a ,JBREAST CANCER GENE" 
polynucleotide. Antisense oligonucleotides which comprise, for example, 2, 3, 4, or 5 or more 
stretches of contiguous nucleotides which are precisely complementary to a „BREAST CANCER 
GENE" polynucleotide, each separated by a stretch of contiguous nucleotides which are not 
complementary to adjacent „BREAST CANCER GENE" nucleotides, can provide sufficient 
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targeting specificity for „BREAST CANCER GENE" mRNA. Preferably, each stretch 
complementary contiguous nucleotides is at least 4, 5, 6, 7, or 8 or more nucleotides in leng 
Non-complementary intervening sequences are preferably 1, 2, 3, or 4 nucleotides in length. C 
stalled in the art can easily use the calculated melting point of an antisense-sense pair to determi 
the degree of mismatching which will be tolerated between a particular antisense oligonucleoti 
and a particular „BREAST CANCER GENE" polynucleotide sequence. 

Antisense oligonucleotides can be modified without affecting their ability to hybridize to 
,3REAST CANCER GENE" polynucleotide. These modifications can be internal or at one . 
both ends of the antisense molecule. For example, intemucleoside phosphate linkages can t 
modtfed by adding cholesteryl or diamine moieties with varying numbers of carbon residu. 
between the amino groups and terminal ribose. Modified bases and/or sugars, such as arabino* 
instead of ribose, or a 3', 5' substituted oligonucleotide in which the 3' hydroxyl group or the 
Phosphate group are substituted, also can be employed in a modified antisense oligonucleotid. 
These modified oligonucleotides can be prepared by methods well known in the art[ Agrawal < 
al., 1992, (130); Uhlmann et al., 1987, (131) and Uhlmann et al., (128)]. 

Ribozymes 

Ribozymes are RNA molecules with catalytic activity [Cech, 1987, (132); Cech, 1990 (133) an, 
Couture & Stinchcomb, 1996, (134)]. Ribozymes can be used to inhibit gene function by>leavin< 
an RNA sequence, as is known in the art (e.g., Haseloff et al., U.S. Patent 5,641,673) Th, 
mechamsm of ribozyme action involves sequence-specific hybridization of the ribozyme molecule 
to complementary target RNA, followed by endonucleolytic cleavage. Examples include 
engmeered hammerhead motif ribozyme molecules that can specifically and efficiently catalyze 
endonucleolytic cleavage of specific nucleotide sequences. 

The transcribed sequence of a „BREAST CANCER GENE" can be used to generate ribozymes 
wtach will specifically bind to mRNA transcribed from a , 3 REAST CANCER GENE" genomic 
locus. Methods of designing and constructing ribozymes which can cleave other RNA molecules 
in trans in a highly sequence specific manner have been developed and described in the art 
[Haseloff et al., 1988, (135)]. For example, the cleavage activity of ribozymes can be targeted to 
specific RNAs by engineering a discrete "hybridization" region into the ribozyme. The 
hybridization region contains a sequence complementary to the target RNA and thus specifically 
hybridizes with the target [see, for example, Gerlach et al., EP 0 321201J. 
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Specific ribozyme cleavage sites within a ..BREAST CANCER GENE" RNA target can be 
identified by scanning the target molecule for ribozyme cleavage sites which include the following 
sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides corresponding to the region of the target RNA containing the cleavage site can be 
evaluated for secondary structural features which may render the target inoperable. Suitability of 
candidate ..BREAST CANCER GENE" RNA targets also can be evaluated by testing accessibility 
to hybridization with complementary oligonucleotides using ribonuclease protection assays. 
Longer complementary sequences can be used to increase the affinity of the hybridization 
sequence for the target The hybridizing and cleavage regions of the ribozyme can be integrally 
related such that upon hybridizing to the target RNA through the complementary regions, the 
catalytic region of the ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct. Mechanical methods, such # 
microinjection, liposome-mediated transfection, electroporation, or calcium phosphate 
precipitation, can be used to introduce a-ribozyme-containing DNA construct into cells in which it 
is desired to decrease ..BREAST CANCER GENE" expression. Alternatively, if it is desired that 
the cells stably retain the DNA construct, the construct can be supplied on a plasmid and 
maintained as a separate element or integrated into the genome of the cells, as is known in the art. 
A ribozyme-encoding DNA construct can include transcriptional regulatory elements, such as a 
promoter element, an enhancer or UAS element, and a transcriptional terminator signal, for 
controlling transcription of ribozymes in the cells. 

As taught in Haseloff et al., U.S Pat. No. 5,641,673, ribozymes can be engineered so that ribozyme 
expression will occur in response to factors which induce expression of a target gene. Ribozymes 
also can be engineered to provide an additional level of regulation, so that destruction of mRl^ 
occurs only when both a ribozyme and a target gene are induced in the cells. 
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Polypeptides 

"BREAST CANCER GENE" polypeptides according to the invention comprise an polypeptic 
selected from SEQ ID NO: 27 to 52 and 76 to 98 or encoded by any of the polynucleotid 
sequences of the SEQ ID NO: 1 to 26 and 53 to 75 or derivatives, fragments, analogues an 
homologues thereof. A "BREAST CANCER GENE" polypeptide of the invention therefore can b 
a portion, a full-length, or a fusion protein comprising all or a portion of a "BREAST CANCE1 
GENE" polypeptide. 

Protein Purification 

.3REAST CANCER GENE" polypeptides can be purified from any cell which expresses th, 
enzyme, including host cells which have been transfected with JBREAST CANCER GENE' 
expression constructs. Breast tissue is an especially useful source of „BREAST CANCER GENE' 
polypeptides. A purified „BREAST CANCER GENE" polypeptide is separated from othe. 
compounds which normally associate with the „BREAST CANCER GENE" polypeptide in the 
cell, such as certain proteins, carbohydrates, or lipids, using methods well-known in the art. Sue! 
methods include, but are not limited to, size exclusion chromatography, ammonium sulfate 
fractionation, ion exchange chromatography, affinity chromatography, and preparative gel 
electrophoresis. A preparation of purified ,3REAST CANCER GENE" polypeptides is at least 
80% pure; preferably, the preparations are 90%, 95%, or 99% pure. Purity of the preparations can 
be assessed by any means known in the art, such as SDS-polyacrylamide gel electrophoresis. 

Obtaining Polypeptides 

..BREAST CANCER GENE" polypeptides can be obtained, for example, by purification from 
human cells, by expression of .3REAST CANCER GENE" polynucleotides, or by direct chemical 
synthesis. 

Biologically Active Variant* 

,3REAST CANCER GENE" polypeptide variants which are biologically active, i.e., retain an 
.3REAST CANCER GENE" activity, also are ..BREAST CANCER GENE" polypeptides. 
Preferably, naturally or non-naturally occurring ..BREAST CANCER GENE" polypeptide variants 
have amino acid sequences which are at least about 60. 65, or 70, preferably about 75, 80, 85, 90, 
92, 94, 96, or 98% identical to the any of the amino acid sequences of the polypeptides of SEQ ID 
NO: 27 to 52 or 76 to 98 or the polypeptides encoded by any of the polynucleotides of SEQ ID 
NO: 1 to 26 or 53 to 75 or a fragment thereof. Percent identity between a putative .3REAST 
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CANCER GENE" polypeptide variant and of the polypeptides of SEQ ID NO: 27 to 52 or 76 to 98 
or the polypeptides encoded by any of the polynucleotides of SEQ ID NO: 1 to 26 or 53 to 75 or a 
fragment thereof is determined by conventional methods. [See, for example, Altschul et aL> 1986, 
(90 and Henikoff & Henikoff, 1992, (91)]. Briefly, two amino acid sequences are aligned to 
5 optimize the alignment scores using a gap opening penalty of 1 0, a gap extension penalty of 1 , and 
the "BLOSUM62" scoring matrix of Henikoff & Henikoff, (91). 

Those skilled in the art appreciate that there are many established algorithms available to align two 
amino acid sequences. The "FASTA" similarity search algorithm of Pearson & Lipman is a 
suitable protein alignment method for examining the level of identity shared by an amino acid 

10 sequence disclosed herein and the amino acid sequence of a putative variant [Pearson & Lipman, 
1988, (92), and Pearson, 1990, (93)]. Briefly, FASTA first characterizes sequence similarity by 
identifying regions shared by the query sequence (e.g., SEQ ID NO: 1 to 26 or 53 to 75) and a te^ 
sequence that have either the highest density of identities (if the ktup variable is 1) or pairs of 
identities (if ktup=2X without considering conservative amino acid substitutipns, insertions, or 

15 deletions. The ten regions with the highest density of identities are then rescored by comparing the 
similarity of all paired amino acids using an amino acid substitution matrix, and the ends of the 
regions are "trimmed" to include only those residues that contribute to the highest score. If there 
are several regions with scores greater than the "cutoff' value (calculated by a predetermined 
formula based upon the length of the sequence the ktup value), then the trimmed initial regions are 

20 examined to determine whether the regions can be joined to form an approximate alignment with 
gaps. Finally, the highest scoring regions of the two amino acid sequences are aligned using a 
modification of the Needleman-Wunsch-Sellers algorithm [Needleman & Wunsch, 1970, (94), and 
Sellers, 1974, (95)], which allows for amino acid insertions and deletions. Preferred parameters for 
FASTA analysis are: ktup=l, gap opening penalty=10, gap extension penalty=l, and substituti^ 

25 matrix=BLOSUM62. These parameters can be introduced into a FASTA program by modifying 
the scoring matrix file ("SMATRDC"), as explained in Appendix 2 of Pearson, (93). 

FASTA can also be used to determine the sequence identity of nucleic acid molecules using a ratio 
as disclosed above. For nucleotide sequence comparisons, the ktup value can range between one 
to six, preferably from three to six, most preferably three, with other parameters set as default. 

30 Variations in percent identity can be due, for example, to amino acid substitutions, insertions, or 
deletions. Amino acid substitutions are defined as one for one amino acid replacements. They are 
conservative in nature when the substituted amino acid has similar structural and/or chemical 
properties. Examples of conservative replacements are substitution of a leucine with an isoleucine 
or valine, an aspartate with a glutamate, or a threonine with a serine. 
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Amino acid insertions or deletions are changes to or within an amino acid sequence. Thej 
typically fall in the range of about 1 to 5 amino acids. Guidance in determining which amino acic 
residues can be substituted, inserted, or deleted without abolishing biological or immunologics 
activity of a „BKEAST CANCER GENE" polypeptide can be found using computer program} 
well known in the art, such as DNASTAR software. Whether an amino acid change results in z 
biologically active .3REAST CANCER GENE" polypeptide can readily be determined b> 
assaying for „BREAST CANCER GENE" activity, as described for example, in the specific 
Examples, below. Larger insertions or deletions can also be caused by alternative splicing. Protein 
domains can be inserted or deleted without altering the main activity of the protein. 

Fusion Proteins 

Fusion proteins are useful for generating antibodies against .3REAST CANCER GENE" 
polypeptide amino acid sequences and for use in various assay systems. For example, fusion 
proteins can be used to identify proteins which interact with portions of a „BREAST CANCER 
GENE" polypeptide. Protein affinity chromatography or library-based assays for protein-protein 
interactions, such as the yeast two-hybrid or phage display systems, can be used for this purpose. 
Such methods are well known in the art and also can be used as drug screens. 

A ..BREAST CANCER GENE" polypeptide fusion protein comprises two polypeptide segments 
fused together by means of a peptide bond. The first polypeptide segment comprises at least 25, 
50, 75, 100, 150, 200, 300, 400, 500. 600, 700 or 750 contiguous amino acids of an amino acid 
sequence encoded by any polynucleotide sequences of the SEQ ID NO: 1 to 26 or 53 to 75 or of a 
biologically active variant, such as those described above. The first polypeptide segment also can 
comprise full-length JBREAST CANCER GENE". 

The second polypeptide segment can be a full-length protein or a protein fragment. Proteins 
commonly used in fusion protein construction include P-galactosidase, P-glucuronidase, green 
fluorescent protein (GFP), autofluorescent proteins, including blue fluorescent protein (BFP), 
glutathione-S-transferase (GST), luciferase, horseradish peroxidase (HRP), and chloramphenicol 
acetyltransferase (CAT). Additionally, epitope tags are used in fusion protein constructions, 
including histidine (His) tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G 
tags, and thioredoxin (Trx) tags. Other fusion constructions can include maltose binding protein 
(MBP), S-tag, Lex a DNA binding domain (DBD) fusions, GAL4 DNA binding domain fusions, 
and herpes simplex virus (HSV) BP16 protein fusions. A fusion protein also can be engineered to 
contain a cleavage site located between the .3REAST CANCER GENE" polypeptide-encoding 
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sequence and the heterologous protein sequence, so that the „BREAST CANCER GENE" 
polypeptide can be cleaved and purified away from the heterologous moiety. 

A fusion protein can be synthesized chemically, as is known in the art. Preferably, a fusion protein 
is produced by covalently linking two polypeptide segments or by standard procedures in the art of 
molecular biology. Recombinant DNA methods can be used to prepare fusion proteins, for 
example, by making a DNA construct which comprises coding sequences selected from any of the 
polynucleotide sequences of the SEQ ID NO: 1 to 26 and 53 to 75 in proper reading frame with 
nucleotides encoding the second polypeptide segment and expressing the DNA construct iaa host 
cell, as is known in the art. Many kits for constructing fusion proteins are available from 
companies such as Promega Corporation (Madison, WI), Stratagene (La Jolla, CA), CLONTECH 
(Mountain View, CA), Santa Cruz Biotechnology (Santa Cruz, CA), MBL International 
Corporation (MIC; Watertown, MA), and Quantum Biotechnologies (Montreal, Canada; 1-8^1 
DNA-KITS). 

Identification of Species Homologues 

Species homologues of human a „BREAST CANCER GENE" polypeptide can be obtained using 
„BREAST CANCER GENE" polypeptide polynucleotides (described below) to make suitable 
probes or primers for screening cDNA expression libraries from other species, such as mice, 
monkeys, or yeast, identifying cDNAs which encode homologues of a „BREAST CANCER 
GENE" polypeptide, and expressing the cDNAs as is known in the art. 

Expression of Polynucleotides 

To express a ,3REAST CANCER GENE" polynucleotide, the polynucleotide can be inserted ^ 
an expression vector which contains the necessary elements for the transcription and translation of 
the inserted coding sequence. Methods which are well known to those skilled in the art can he 
used to construct expression vectors containing sequences encoding „BREAST CANCER GENE" 
polypeptides and appropriate transcriptional and translational control elements. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic 
recombination. Such techniques are described, for example, in Sambrook et al., (77) and in 
Ausubel et al., (78). 

A variety of expression vector/host systems can be utilized to contain and express sequences 
encoding a „BREAST CANCER GENE" polypeptide. These include, but are not limited to, 
microorganisms, such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid 
DNA expression vectors; yeast transformed with yeast expression vectors, insect cell systems 
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infected with virus expression vectors (e.g., baculovirus), plant cell systems transformed wi 
virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) 
with bacterial expression vectors (e.g., Ti or pBR322 plasmids), or animal cell systems. 

The control elements or regulatory sequences are those regions of the vector enhancers, promoter 
5' and 3' untranslated regions which interact with host cellular proteins to carry out transcriptic 
and translation. Such elements can vary in their strength and specificity. Depending on the vect. 
system and host utilized, any number of suitable transcription and translation elements, includin 
constitutive and inducible promoters, can be used. For example, when cloning in bacterial system; 
inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid (Stratagem 
LaJolla, Calif.) or pSPORTl plasmid (Life Technologies) and the like can be used. Th 
baculovirus polyhedrin promoter can be used in insect cells. Promoters or enhancers derived fror. 
the genomes of plant cells (e.g., heat shock, RUBISCO, and storage protein genes) or from plan 
viruses (e.g., viral promoters or leader sequences) can be cloned into the vector. In mammalia, 
cell systems, promoters from mammalian genes or from mammalian viruses are preferable. If it i; 
necessary to generate a cell line that contains multiple copies of a nucleotide sequence encoding i 
..BREAST CANCER GENE" polypeptide, vectors based on SV40 or EBV can be used with ar 
appropriate selectable marker. 



BHC 03 1 051 



-72- 

Bacterial and Yeast Expression Systems 

In bacterial systems, a number of expression vectors can be selected depending upon the use 
intended for the ..BREAST CANCER GENE" polypeptide. For example, when a large quantity of 
the „BREAST CANCER GENE" polypeptide is needed for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified can be used. Such 
vectors include, but are not limited to, multifunctional E. coli cloning and expression vectors such 
as BLUESCRIPT (Stratagene). In a BLUESCRIPT vector, a sequence encoding the „BREAST 
CANCER GENE" polypeptide can be ligated into the vector in frame with sequences for the 
amino terminal Met and the subsequent 7 residues of B-galactosidase so that a hybrid protein is 
produced. pIN vectors [Van Heeke & Schuster, (17)] or pGEX vectors (Promega, Madison, Wis.) 
also can be used to express foreign polypeptides as fusion proteins with glutathione S-transferase 
(GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells ^ 
adsorption to glutathione agarose beads followed by elution in the presence of free glutathione. 
Proteins made in such systems can-be designed to include heparin, thrombin, or factor Xa protease 
cleavage sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast Saccharomyces cerevisiae, a number of vectors containing constitutive or inducible 
promoters such as alpha factor, alcohol oxidase, and PGH can be used. For reviews, see Ausubel et 
al., (4)andGrantetal.,(18). 

Plant and Insect Exp ression Systems 

If plant expression vectors are used, the expression of sequences encoding „BREAST CANCER 
GENE" polypeptides can be driven by any of a number of promoters. For example, viral promo^ 
such as the 35S and 19S promoters of CaMV can be used alone or in combination with the omega 
leader sequence from TMV [Takamatsu, 1987, (96)]. Alternatively, plant promoters such as the 
small subunit of RUBISCO or heat shock promoters can be used [Coruzzi et al., 1984, (97); 
Broglie et al., 1984, (98); Winter et al., 1991, (99)]. These constructs can be introduced into plant 
cells by direct DNA transformation or by pathogen-mediated transfection. Such techniques are 
described in a number of generally available reviews. 

An insect system also can be used to express a ..BREAST CANCER GENE" polypeptide. For 
example, in one such system Autographa califomica nuclear polyhidrosis virus (AcNPV) is used 
as a vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. 
Sequences encoding ..BREAST CANCER GENE" polypeptides can be cloned into a nonessential 
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region of the virus, such as the polyhedrin gene, and placed under control of the polyhedrin 
promoter. Successful insertion of „BREAST CANCER GENE" polypeptides will render the 
polyhedrin gene inactive and produce recombinant virus lacking coat protein. The recombinant 
viruses can then be used to infect S. frugiperda cells or Trichoplusia larvae in which „BREAST 
CANCER GENE" polypeptides can be expressed [Engelhard et al., 1994, (100)]. 

Mammalian Expression Systems 

A number of viral-based expression systems can be used to express ,3REAST CANCER GENE" 
polypeptides in mammalian host cells. For example, if an adenovirus is used as an expression 
vector, sequences encoding ,3REAST CANCER GENE" polypeptides can be ligated into an 
adenovirus transcription/translation complex comprising the late promoter and tripartite leader 
sequence. Insertion in a nonessential El or E3 region of the viral genome can be used to obtain a 
viable virus which is capable of expressing a ,3REAST CANCER GENE" polypeptide in infected 
host cells [Logan & Shenk, 1984, (101)]. If desired, transcription enhancers, such as the Rous 
sarcoma virus (RSV) enhancer, can be used to increase expression in mammalian host cells. 

Human artificial chromosomes (HACs) also can be used to deliver larger fragments of DNA than 
can be contained and expressed in a plasmid. HACs of 6M to 10M are constructed and delivered 
to cells via conventional delivery methods (e.g., liposomes, polycationic amino polymers, or 
vesicles). 

Specific initiation signals also can be used to achieve more efficient translation of sequences 
encoding „BREAST CANCER GENE" polypeptides. Such signals include the ATG initiation 
codon and adjacent sequences. In cases where sequences encoding a ,JBREAST CANCER GENE" 
polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate 
expression vector, no additional transcriptional or translational control signals may be needed. 
However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals (including the ATG initiation codon) should be provided. The 
initiation codon should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons can be of various origins, both natural and 
synthetic. The efficiency of expression can be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used [Scharf et al., 1994, (102)]. 

Host Cells 



A host ceil strain can be chosen for its ability to modulate the expression of the inserted sequences 
or to process the expressed „BREAST CANCER GENE" polypeptide in the desired fashion. Such 
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modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Posttranslational processing which 
cleaves a "prepro" form of the polypeptide also can be used to facilitate correct insertion, folding 
and/or function. Different host cells which have specific cellular machinery and characteristic 
5 mechanisms for Post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are 
available from the American Type Culture Collection (ATCC; 10801 University Boulevard, 
Manassas, VA 201 10-2209) and can be chosen to ensure the correct modification and processing 
of the foreign protein. 

Stable expression is preferred for long-term, high-yield production of recombinant proteins. For 
10 example, cell lines which stably express „BREAST CANCER GENE" polypeptides can be 
transformed using expression vectors which can contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vectd^^ 
Following the introduction of the vector, cells can be allowed to grow for 12 days in an enriched 
. . . medium before they are switched to a selective medium. The purpose of the selectable marker is to 
15 confer resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced ,3REAST CANCER GENE" sequences. Resistant clones of 
stably transformed cells can be proliferated using tissue culture techniques appropriate to the cell 
type [Freshney et al., 1986, (103). 

Any number of selection systems can be used to recover transformed cell lines. These include, but 
20 are not limited to, the herpes simplex virus thymidine kinase (Wigler et al., 1977, (104)] and 
adenine phosphoribosyltransferase [Lowy et al., 1980, (105)] genes which can be employed in tk~ 
or aprf cells, respectively. Also, antimetabolite, antibiotic, or herbicide resistance can be used as 
the basis for selection. For example, dhfir confers resistance to methotrexate [Wigler et al., 19^ 
(106)], npt confers resistance to the aminoglycosides, neomycin and G418 [Colbere-Garapin et al., 
25 1981, (107)], and als and pat confer resistance to chlcrsulfurcn and phosphinotricin 
acetyltransferase, respectively. Additional selectable genes have been described. For example, 
trpB allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize 
histinol in place of histidine [Hartman & Mulligan, 1988 ,(108)]. Visible markers such as 
anthocyanins, B-glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, can 
30 be used to identify transformants and to quantify the amount of transient or stable protein 
expression attributable to a specific vector system [Rhodes et al., 1995, (109)]. 
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Detectine Expression and vene product 

Although the presence of marker gene expression suggests that the .3REAST CANCER GENE 
polynucleotide is also present, its presence and expression may need to be confirmed. Fc 
example, if a sequence encoding a , .BREAST CANCER GENE" polypeptide is inserted within 
marker gene sequence, transformed cells containing sequences which encode a .3REAS' 
CANCER GENE" polypeptide can be identified by the absence of marker gene functior. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding a ..BREAST 
CANCER GENE" polypeptide under the control of a single promoter. Expression of the marke 
gene in response to induction or selection usually indicates expression of the ..BREAST CANCEF 
GENE" polynucleotide. 

Alternatively, host cells which contain a .3REAST CANCER GENE" polynucleotide and whicl 
express a ..BREAST CANCER GENE" polypeptide can be identified by a variety of procedure* 
known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA oi 
DNA-RNA hybridization and protein bioassay or immunoassay techniques which include 
membrane, solution, or chip-based technologies for the detection and/or quantification oi 
polynucleotide or protein. For example, the presence of a polynucleotide sequence encoding a 
.3REAST CANCER GENE" polypeptide can be detected by DNA-DNA or DNA-RNA 
hybridization or amplification using probes or fragments or fragments of polynucleotides encoding 
a ..BREAST CANCER GENE" polypeptide. Nucleic acid amplification-based assays involve the 
use of oligonucleotides selected from sequences encoding a .3REAST CANCER. GENE" 
polypeptide to detect transformants which contain a .3REAST CANCER GENE" polynucleotide. 

A variety of protocols for detecting and measuring the expression of a .3REAST CANCER 
GENE" polypeptide, using either polyclonal or monoclonal antibodies specific for the polypeptide, 
are known in the art Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two-site, 
monoclonal-based immunoassay using monoclonal antibodies reactive to two non-interfering 
epitopes on a J3REAST CANCER GENE" polypeptide can be used, or a competitive binding 
assay can be employed. These and other assays are described in Hampton et al., (1 10) and Maddox 
etal., 111). 

A wide variety of labels and conjugation techniques are known by those skilled in the art and can 
be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding .3REAST CANCER 
GENE" polypeptides include oligo labeling, nick translation, end-labeling, or PCR amplification 
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using a labeled nucleotide. Alternatively, sequences encoding a ..BREAST CANCER GENE" 
polypeptide can be cloned into a vector for the production of an mRNA probe. Such vectors are 
known in the art, are commercially available, and can be used to synthesize RNA probes in vitro 
by addition of labeled nucleotides and an appropriate RNA polymerase such as T7, T3, or SP6. 
These procedures can be conducted using a variety of commercially available kits (Amersham 
Pharmacia Biotech, Promega, and US Biochemical). Suitable reporter molecules or labels which 
can be used for ease of detection include radionuclides, enzymes, and fluorescent, 
chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic 
particles, and the like. 

Expression and Purific ation of Polypeptides 

Host cells transformed with nucleotide sequences encoding a ..BREAST CANCER GENE^ 
polypeptide can be cultured under conditions suitable for the expression and recovery of the 
protein from cell culture. The polypeptide produced by a transformed cell can be secreted or 
stored intracellular depending on the sequence and/or the vector used. As will be understood by 
those of skill in the art, expression vectors containing polynucleotides which encode „BREAST 
CANCER GENE" polypeptides can be designed to contain signal sequences which direct secretion 
of soluble ..BREAST CANCER GENE" polypeptides through a prokaryotic or eukaryotic cell 
membrane or which direct the membrane insertion of membrane-bound ..BREAST CANCER 
GENE" polypeptide. 

As discussed above, other constructions can be used to join a sequence encoding a „BREAST 
CANCER GENE" polypeptide to a nucleotide sequence encoding a polypeptide domain which 
will facilitate purification of soluble proteins. Such purification facilitating domains include, but 
are not limited to, metal chelating peptides such as histidine-tryptophan modules that allW 
purification on immobilized metals, protein A domains that allow purification on immobilized 
immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system 
(Immunex Corp., Seattle, Wash.). Inclusion of cleavable linker sequences such as those specific 
for Factor Xa or enterokinase (Invitrogen, San Diego, CA) between the purification domain and 
the ..BREAST CANCER GENE" polypeptide also can be used to facilitate purification. One such 
expression vector provides for expression of a fusion protein containing a ..BREAST CANCER 
GENE" polypeptide and 6 histidine residues preceding a thioredoxin or an enterokinase cleavage 
site. The histidine residues facilitate purification by DS1AC (immobilized metal ion affinity 
chromatography [Porath et at, 1992, (1 12)], while the enterokinase cleavage site provides a means 
for purifying the .3REAST CANCER GENE" polypeptide from the fusion protein. Vectors 
which contain fusion proteins are disclosed in Kroll et al., (1 13). 
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Chemical Synthesis 

Sequences encoding a „BREAST CANCER GENE" polypeptide can be synthesized, in whole o: 
in part, using chemical methods well known in the art (see Caruthers et al., (1 14) and Horn et al. 
(115). Alternatively, a ,3REAST CANCER GENE" polypeptide itself can be produced usin* 
chemical methods to synthesize its amino acid sequence, such as by direct peptide synthesis usinj 
solid-phase techniques [Merrifield, 1963, (116) and Roberge et al., 1995, (117)]. Protein synthesis 
can be performed using manual techniques or by automation. Automated synthesis can be 
achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer). 
Optionally, fragments of ,JBREAST CANCER GENE" polypeptides can be separately synthesized 
and combined using chemical methods to produce a full-length molecule. 

The newly synthesized peptide can be substantially purified by preparative high performance 
liquid chromatography [Creighton, 1983, (118)]. The composition of a synthetic „BREAST 
CANCER GENE" polypeptide can be confirmed by amino acid analysis or sequencing (e.g., the 
Edman degradation procedure; see Creighton, (118). Additionally, any portion of the amino acid 
sequence of the „BREAST CANCER GENE" polypeptide can be altered during direct synthesis 
and/or combined using chemical methods with sequences from other proteins to produce a variant 
polypeptide or a fusion protein. 

Production of Altered P olypeptides 

As will be understood by those of skill in the art, it may be advantageous to produce „BREAST 
CANCER GENE" polypeptide-encoding nucleotide sequences possessing non-natural occurring 
codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be 
selected to increase the rate of protein expression or to produce an RNA transcript having 
desirable properties, such as a half-life which is longer than that of a transcript generated from the 
naturally occurring sequence. 

The nucleotide sequences disclosed herein can be engineered using methods generally known in 
the art to alter ,3REAST CANCER GENE" polypeptide-encoding sequences for a variety of 
reasons, including but not limited to, alterations which modify the cloning, processing, and/or 
expression of the polypeptide or mRNA product. DNA shuffling by random fragmentation and 
PCR re-assembly of gene fragments and synthetic oligonucleotides can be used to engineer the 
nucleotide sequences. For example, site-directed mutagenesis can be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, introduce 
mutations, and so forth. 
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Predictive. Diagnostic and Prognostic Assays 

The present invention provides method for determining whether a subject is at risk for developing 
malignant neoplasia and breast cancer in particular by detecting one of the disclosed 
polynucleotide markers comprising any of the polynucleotides sequences of the SEQ ID NO: 2 to 
6, 8, 9, 1 1 to 16, 18, 19 or 21 to 26 or 53 to 75 and/or the polypeptide markers encoded thereby or 
polypeptide markers comprising any of the polypeptide sequences of the SEQ ID NO: 28 to 32, 
34, 35, 37 to 42, 44, 45 or 47 to 52 or 76 to 98 or at least 2 of the disclosed polynucleotides 
selected from SEQ ID NO: I to 26 and 53 to 75 or the at least 2 of the disclosed polypeptides 
selected from SEQ ID NO: 28 to 32 and 76 to 98 for malignant neoplasia and breast cancer in 
particular. 

In clinical applications, biological samples can be screened for the presence and/or absence of tj^ 
biomarkers identified herein. Such samples are for example needle biopsy cores, surgical resection 
samples, or body fluids like serum, thin needle nipple aspirates and urine. For example, these 
methods include obtaining a biopsy, which is optionally fractionated by cryostat sectioning to 
enrich diseases cells to about 80% of the total cell population. In certain embodiments, 
polynucleotides extracted from these samples may be amplified using techniques well known in 
the art. The expression levels of selected markers detected would be compared with statistically 
valid groups of diseased and healthy samples. 

In one embodiment the diagnostic method comprises determining whether a subject has an 
abnormal mRNA and/or protein level of the disclosed markers, such as by Northern blot analysis, 
reverse transcription-polymerase chain reaction (RT-PCR), in situ hybridization, 
immunoprecipitation, Western blot hybridization, or immunohistochemistry. According to the 
method, cells are obtained from a subject and the levels of the disclosed biomarkers, protein^r 
mRNA level, is determined and compared to the level of these markers in a healthy subject An 
abnormal level of the biomarker polypeptide or mRNA levels is likely to be indicative of 
malignant neoplasia such as breast cancer. 

In another embodiment the diagnostic method comprises determining whether a subject has an 
abnormal DNA content of said genes or said genomic loci, such as by Southern blot analysis, dot 
blot analysis, fluorescence or colorimetric In Situ hybridization, comparative genomic 
hybridization, genotpying by VNTR, STS-PCR or quantitative PCR. In general these assays 
comprise the usage of probes from representative genomic regions. The probes contain at least 
parts of said genomic regions or sequences complementary or analogous to said regions. In 
particular intra- or intergenic regions of said genes or genomic regions. The probes can consist of 



BHC03 1 051 



-79- 

nucleotide sequences or sequences of analogous functions (e.g. PNAs, Morpholino oligomers) 
being able to bind to target regions by hybridization. In general genomic regions being altered in 
said patient samples are compared with unaffected control samples (normal tissue from the same 
or different patients, surrounding unaffected tissue, peripheral blood) or with genomic regions of 
the same sample that don't have said alterations and can therefore serve as internal controls. In a 
preferred embodiment regions located on the same chromosome are used. Alternatively, 
gonosomal regions and /or regions with defined varying amount in the sample are used. In one 
favored embodiment the DNA content, structure, composition or modification is compared that lie 
within distinct genomic regions. Especially favored are methods that detect the DNA content of 
said samples, where the amount of target regions are altered by amplification and or deletions. In 
another embodiment the target regions are analyzed for the presence of polymorphisms (e.g. 
Single Nucleotide Polymorphisms or mutations) that affect or predispose the cells in said samples 
with regard to clinical aspects, being of diagnostic, prognostic or therapeutic value. Preferably, the 
identification of sequence variations is used to define haplotypes that result in characteristic 
behavior of said samples with said clinical aspects. 

The following examples of genes in 17q 12-2 1.2 are offered by way of illustration, not by way of 
limitation. 

One embodiment of the invention is a method for the prediction, diagnosis or prognosis of 
malignant neoplasia by the detection of at least 10, at least 5, or at least 4, or at least 3 and more 
preferably at least 2 markers whereby the markers are genes and fragments thereof and/or genomic 
nucleic acid sequences that are located on one chromosomal region which is altered in malignant 
neoplasia. 

One further embodiment of the invention is method for the prediction, diagnosis or prognosis of 
malignant neoplasia by the detection of at least 10, at least 5, or at least 4, or at least 3 and more 
preferably at least 2 markers whereby the markers (a) are genes and fragments thereof and/or 
genomic nucleic acid sequences that are located on one or more chromosomal region(s) which 
is/are altered in malignant neoplasia and (b) functionally interact as (i) receptor and ligand or (ii) 
members of the same signal transduction pathway or (iii)members of synergistic signal 
transduction pathways or (iv) members of antagonistic signal transduction pathways or (v) 
transcription factor and transcription factor binding site. 

In one embodiment, the method for the prediction, diagnosis or prognosis of malignant neoplasia 
and breast cancer in particular is done by the detection of: 
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(a) polynucleotide selected from the polynucleotides of the SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 
18, 19, 21 to 26 or 53 to 75; 

(b) a polynucleotide which hybridizes under stringent conditions to a polynucleotide specified 
in (a) encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3; 

(c) a polynucleotide the sequence of which deviates from the polynucleotide specified in (a) 
and (b) due to the generation of the genetic code encoding a polypeptide exhibiting the 
same biological function as specified for the respective sequence in Table 2 or 3; 

(d) a polynucleotide which represents a specific fragment, derivative or allelic variation of a 
polynucleotide sequence specified in (a) to (c); 

in a biological sample comprising the following steps: hybridizing any polynucleotide or 
. analogous oligomer, specified in (a) to (do) to a polynucleotide material of a biological sample, 
thereby forming a hybridization complex; and detecting said hybridization complex. 

In another embodiment the method for the prediction, diagnosis or prognosis of malignant 
neoplasia is done as just described but, wherein before hybridization, the polynucleotide material 
of the biological sample is amplified. 

In another embodiment the method for the diagnosis or prognosis of malignant neoplasia and 
breast cancer in particular is done by the detection of: 

(a) a polynucleotide selected from the polynucleotides of the SEQ ID NO: 2 to 6, 8, 9, 11 to 
16, 18, 19, 21 to 26 or 53 to 75; 0 

(b) a polynucleotide which hybridizes under stringent conditions to a polynucleotide specified 
in (a) encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3; 

(c) a polynucleotide the sequence of which deviates from the polynucleotide specified in (a) 
and (b) due to the generation of the genetic code encoding a polypeptide exhibiting the 
same biological function as specified for the respective sequence in Table 2 or 3; 

(d) a polynucleotide which represents a specific fragment, derivative or allelic variation of a 
polynucleotide sequence specified in (a) to (c); 

(e) a polypeptide encoded by a polynucleotide sequence specified in (a) to (d) 
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(0 a polypeptide comprising any polypeptide of SEQ ID NO: 28 to 32, 34, 35, 37 to 42, < 
45, 47 to 52 or 76 to 98; 

comprising the steps of contacting a biological sample with a reagent which specifically interac 
with the polynucleotide specified in (a) to (d) or the polypeptide specified in (e). 

DNA array tprhnnlr.^ , 

In one embodiment, the present Invention also provides a method wherein polynucleotide prob, 
are immobilized an a DNA chip in an organized array. Oligonucleotides can be bound to a soli 
Support by a variety of processes, including lithography. For example a chip can hold up 1 
4100,00 oligonucleotides (GeneChip, Affymetrix). The present invention provides signifies 
advantages over the available tests for malignant neoplasia, such as breast cancer, because 
increases the reliability of the test by providing an array of polynucleotide markers an a singl 
chip. 

The method includes obtaining a biopsy of an affected person, which is optionally fractionated fr 
cryostat sectioning to enrich diseased cells to about 80% of the total cell population and the use o 
body fluids such as serum or urine, serum or cell containing liquids (e.g. derived from fine needh 
aspnrates). The DNA or RNA is then extracted, amplified, and analyzed with a DNA chip t< 
determine the presence of absence of the marker polynucleotide sequences. In one embodiment 
the polynucleotide probes are spotted onto a substrate in a two-dimensional matrix or arrayi 
samples of polynucleotides can be labeled and then hybridized to the probes. Double-stranded 
polynucleotides, comprising the labeled sample polynucleotides bound to probe polynucleotides, 
can be detected once the unbound portion of the sample is washed away. 

The probe polynucleotides can be spotted an substrates including glass, nitrocellulose, etc The 
probes can be bound to the Substrate by either covalent bonds or by non-specific interactions, such 
as hydrophobic interactions. The sample polynucleotides can be labeled using radioactive labels 
fluorophores, chromophores, etc. Techniques for constructing arrays and methods of using these 
arrays are described in EP 0 799 897; WO 97/29212; WO 97/27317; EP 0 785 280; WO 97/02357- 
U.S. Pat No. 5,593,839; U.S. Pat. No. 5,578,832; EP 0 728 520; U.S. Pat. No. 5,599,695; EP 0 721 
016; U.S. Pat. No. 5,556,752; WO 95/22058; and U.S. Pat No. 5,631,734. Further, arrays can be 
used to examine differential expression of genes and can be used to determine gene function. For 
example, arrays of the instant polynucleotide sequences can be used to determine if any of the 
polynucleotide sequences are differentially expressed between normal cells and diseased cells, for 
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example. High expression of a particular message in a diseased sample, which is not observed in a 
corresponding normal sample, can indicate a breast cancer specific protein. 

Accordingly, in one aspect, the invention provides probes and primers that are specific to the 
unique polynucleotide markers disclosed herein. 

In one embodiment, the method comprises using a polynucleotide probe to determine the presence 
of malignant or breast cancer cells in particular in a tissue from a patient. Specifically, the method 
.. comprises: 

1) providing a polynucleotide probe comprising a nucleotide sequence at least 12 nucleotides 
in length, preferably at least 15 nucleotides, more preferably, 25 nucleotides, and most 
preferably at least 40 nucleotides, and up to all or nearly all of the coding sequence which 
is complementary to a portion of the coding sequence of a polynucleotide selected fronP 
the polynucleotides of SEQ ID NO: 1 to 26 and 53 to 75 or a sequence complementary 
thereto and is 

2) differentially expressed in malignant neoplasia, such as breast cancer; 

3) obtaining a tissue sample from a patient with malignant neoplasia; 

4) providing a second tissue sample from a patient with no malignant neoplasia; 

5) contacting the polynucleotide probe under stringent conditions with RNA of each of said 
first and second tissue samples (e.g., in a Northern blot or in situ hybridization assay); and 

6) comparing (a) the amount of hybridization of the probe with RNA of the first tiss|| 
sample, with (b) the amount of hybridization of the probe with RNA of the second tissue 





oanipi*-, 



wherein a statistically significant difference in the amount of hybridization with the RNA of the 
first tissue sample as compared to the amount of hybridization with the RNA of the second tissue 
sample is indicative of malignant neoplasia and breast cancer in particular in the first tissue 
sample. 



Data analysis methods 



Comparison of the expression levels of one or more "BREAST CANCER GENES" with reference 
expression levels, e.g., expression levels in diseased cells of breast cancer or in normal counterpart 
cells, is preferably conducted using computer systems. In one embodiment, expression levels are 
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obtained in two cells and these two sets of expression levels are introduced into a computer systei 
for comparison. In a preferred embodiment, one set of expression levels is entered into a compuu 
system for comparison with values that are already present in the computer system, or i 
computer-readable form that is then entered into the computer system. 

In one embodiment, the invention provides a computer readable form of the gene expressio 
profile data of the invention, or of values corresponding to the level of expression of at least on 
"BREAST CANCER GENE" in a diseased cell. The values can be mRNA expression level 
obtained from experiments, e.g., microarray analysis. The values can also be mRNA level 
normalised relative to a reference gene whose expression is constant in numerous cells unde 
numerous conditions, e.g., GAPDH. In other embodiments, the values in the computer are ratio: 
of, or differences between, normalized or non-normalized mRNA levels in different samples. 

The gene expression profile data can be in the form of a table, such as an Excel table. The data car 
be alone, or it can be part of a larger database, e.g., comprising other expression profiles. Foi 
example, the expression profile data of the invention can be part of a public database. The 
computer readable form can be in a computer. In another embodiment, the invention provides a 
computer displaying the gene expression profile data. 

In one embodiment, the invention provides a method for determining the similarity between the 
level of expression of one or more "BREAST CANCER GENES" m a first cell, e.g., a cell of a 
subject, and that in a second cell, comprising obtaining the level of expression of one or more 
"BREAST CANCER GENES" in a first cell and entering these values into a computer comprising 
a database including records comprising values corresponding to levels of expression of one or 
more "BREAST CANCER GENES" in a second cell, and processor instructions, e.g., a user 
interface, capable of receiving a selection of one or more values for comparison purposes with 
data that is stored in the computer. The computer may further comprise a means for converting the 
comparison data into a diagram or chart or other type of output. 

In another embodiment, values representing expression levels of "BREAST CANCER GENES" 
are entered into a computer system, comprising one or more databases with reference expression 
levels obtained from more than one cell. For example, the computer comprises expression data of 
diseased and normal cells. Instructions are provided to the computer, and the computer is capable 
of comparing the data entered with the data in the computer to determine whether the data entered 
is more similar to that of a normal cell or of a diseased cell. 
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In another embodiment, the computer comprises values of expression levels in cells of subjects at 
different stages of breast cancer, and the computer is capable of comparing expression data entered 
into the computer with the data stored, and produce results indicating to which of the expression 
profiles in the computer, the one entered is most similar, such as to determine the stage of breast 
cancer in the subject. 

In yet another embodiment, the reference expression profiles in the computer are expression 
profiles from cells of breast cancer of one or more subjects, which cells are treated in vivo or in 
vitro with a drug used for therapy of breast cancer. Upon entering of expression data of a cell of a 
subject treated in vitro or in vivo with the drug, the computer is instructed to compare the data 
entered to the data in the computer, and to provide results indicating whether the expression data 
input into the computer are more similar to those of a cell of a subject that is responsive to the 
drug or more similar to those of a cell of a subject that is not responsive to the drug. Thus, tl£ 
results indicate whether the subject is likely to respond to the treatment with the drug or unlikely 
to respond to it. - 

In one embodiment, the invention provides a system mat comprises a means for receiving gene 
expression data for one or a plurality of genes; a means for comparing the gene expression data 
from each of said one or plurality of genes to a common reference frame; and a means for 
presenting the results of the comparison. This system may further comprise a means for clustering 
the data. 

In another embodiment, the invention provides a computer program for analyzing gene expression 
data comprising (i) a computer code that receives as input gene expression data for a plurality of 
genes and (ii) a computer code that compares said gene expression data from each of said pluraltt 
of genes to a common reference frame. 

The invention also provides a machine-readable or computer-readable medium including program 
instructions for performing the following steps: (i) comparing a plurality of values corresponding 
to expression levels of one or more genes characteristic of breast cancer in a query cell with a 
database including records comprising reference expression or expression profile data of one or 
more reference cells and an annotation of the type of cell; and (ii) indicating to which cell the 
query cell is most similar based on similarities of expression profiles. The reference cells can be 
cells from subjects at different stages of breast cancer. The reference cells can also be cells from 
subjects responding or not responding to a particular drug treatment and optionally incubated in 
vitro or in vivo with the drug. 
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The reference cells may also be cells from subjects responding or not responding to seve 
Afferent treatments, and the computer system indicates a preferred treatment for the subje 
Accordingly, the invention provides a method for selecting a therapy for a patient having bre S 
cancer, the method comprising: (i) providing the level of expression of one or more gen 
characteristic of breast cancer in a diseased cell of the patient; (ii) providing a plurality 
reference profiles, each associated with a therapy, wherein the subject expression profile and ea< 
reference profile has a plurality of values, each value representing the level of expression of a ge, 
characteristic of breast cancer; and (Hi) selecting the reference profile most similar to the subje 
expression profile, to thereby select a therapy for said patient. In a preferred embodiment step (ii 
is performed by a computer. The most similar reference profile may be selected by weighing 
comparison value of the plurality using a weight value associated with the correspond* 
expression data. 

The relative abundance of an mRNA in two biological samples can be scored as a perturbation an 
* magnitude determined (i.e., the abundance is different in the two sources of mRNA tested) o 
as not perturbed (i.e., the relative abundance is the same). In various embodiments, a differeno 
between the two sources of RNA of at least a factor of about 25% (RNA from one source is 25°, 
more abundant in one source than the other source), more usually about 50%, even more often * 
a factor of about 2 (twice as abundant), 3 (three times as abundant) or 5 (five times as abundant) i' 
scored as a perturbation. Perturbations can be used by a computer for calculating and expressior 
comparisons. 

Preferably, in addition to identifying a perturbation as positive or negative, it is advantageous to 
determine the magnitude of the perturbation. This can be carried out, as noted above, by 
calculating the ratio of the emission of the two fluorophores used for differential labeling, or by 
analogous methods that will be readily apparent to those of skill in the art. 

The computer readable medium may further comprise a pointer to a descriptor of a stage of breast 
cancer or to a treatment for breast cancer. 

In operation, the means for receiving gene expression data, the means for comparing the gene 
expression data, the means for presenting, the means for normalizing, and the means for clustering 
within the context of the systems of the present invention can involve a programmed computer 
with the respective functionalities described herein, implemented in hardware or hardware and 
software; a logic circuit or other component of a programmed computer that performs the 
operations specifically identified herein, dictated by a computer program; or a computer memory 
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encoded with executable instructions representing a computer program that can cause a computer 
to function in the particular fashion described herein. 

Those skilled in the art will understand that the systems and methods of the present invention may 
be applied to a variety of systems, including IBM-compatible personal computers running MS- 
DOS or Microsoft Windows. 

The computer may have internal components linked to external components. The internal 
components may include a processor element interconnected with a main memory. The computer 
system can be an Intel Pentium®-based processor of 200 MHz or greater clock rate and with 32 
MB or more of main memory. The external component may comprise a mass storage, which can 
be one or more hard disks (which are typically packaged together with the processor and memory). 
Such hard disks are typically of 1 GB or greater storage capacity. Other external componen^ 
include a user interface device, which can be a monitor, together with an inputing device, which 
can be a "mouse", or other graphic input devices, and/or a keyboard. A printing device can also be 
attached to the computer. 

Typically, the computer system is also linked to a network link, which can be part of an Ethernet 
link to other local computer systems, remote computer systems, or wide area communication 
networks, such as the Internet. This network link allows the computer system to share data and 
processing tasks with other computer systems. 

Loaded into memory during operation of this system are several software components, which are 
both standard in the art and special to the instant invention. These software components 
collectively cause the computer system to function according to the methods of this invention. 
These software components are typically stored on a mass storage. A software compon<0 
represents the operating system, which is responsible for managing the computer system and its 
network interconnections. This operating system can be, for example, of the Mtcrosoft Windows 
family, such as Windows 95, Windows 98, or Windows NT. A software component represents 
common languages and functions conveniently present on this system to assist programs 
implementing the methods specific to this invention. Many high or low level computer languages 
can be used to program the analytic methods of this invention. Instructions can be interpreted 
during run-time or compiled. Preferred languages include C/C4+, and JAVA*. Most preferably, 
the methods of this invention are programmed in mathematical software packages which allow 
symbolic entry of equations and high-level specification of processing, including algorithms to be 
used, thereby freeing a user of the need to procedurally program individual equations or 
algorithms. Such packages include Matlab from Mathworks (Natick, Mass.), Mathematica from 
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Wolfram Research (Champaign, III.), or S-Plus from Math Soft (Cambridge, Mass.). Accordingly 
a software component represents the analytic methods of this invention as programmed in 
procedural language or symbolic package. In a preferred embodiment, the computer system als 
contains a database comprising values representing levels of expression of one or more gene 
characteristic of breast cancer. The database may contain one or more expression profiles of gene 
characteristic of breast cancer in different cells. 

In an exemplary implementation, to practice the methods of the present invention, a user firs 
loads expression profile data into the computer system. These data can be directly entered by th. 
user from a monitor and keyboard, or from other computer systems linked by a networl 
connection, or on removable storage media such as a CD-ROM or floppy disk or through th< 
network. Next the user causes execution of expression profile analysis software which performs 
the steps of comparing and, e.g., clustering co-varying genes into groups of genes. 

In another exemplary implementation, expression profiles are compared using a method describee 
in U.S. Patent No. 6,203,987. A user first loads expression profile data into the computer system. 
Geneset profile definitions are loaded into the memory from the storage media or from a remote 
computer, preferably from a dynamic geneset database system, through the network. Next the user 
causes execution of projection software which performs the steps of converting expression profile 
to projected expression profiles. The projected expression profiles are then displayed. 

In yet another exemplary implementation, a user first leads a projected profile into the memory. 
The user then causes the loading of a reference profile into the memory. Next, the user causes the 
execution of comparison software which performs the steps of objectively comparing the profiles. 

Detection of variant polynucleotide sequence 

In yet another embodiment, the invention provides methods for determining whether a subject is at 
risk for developing a disease, such as a predisposition to develop malignant neoplasia, for example 
breast cancer, associated with an aberrant activity of any one of the polypeptides encoded by any 
of the polynucleotides of the SEQ ID NO: 1 to 26 or 53 to 75, wherein the aberrant activity of the 
polypeptide is characterized by detecting the presence or absence of a genetic lesion characterized 
by at least one of these: 

(i) an alteration affecting the integrity of a gene encoding a marker polypeptides, or 

(ii) the misexpression of the encoding polynucleotide. 
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To illustrate, such genetic lesions can be detected by ascertaining the existence of at least one of 
these: 

I. a deletion of one or more nucleotides from the polynucleotide sequence 
EL an addition of one or more nucleotides to the polynucleotide sequence 

III. a substitution of one or more nucleotides of the polynucleotide sequence 

IV. a gross chromosomal rearrangement of the polynucleotide sequence 

V. a gross alteration in the level of a messenger RNA transcript of the polynucleotide 
sequence 

VI. aberrant modification of the polynucleotide sequence, such as of the methylation pattei^| 
of the genomic DNA 

VII. the presence of a non-wild type splicing pattern of a messenger KNA transcript of the gene 

VIII. a non-wild type level of the marker polypeptide 

IX. allelic loss of the gene 

X. allelic gain of the gene 

XI. inappropriate post-translational modification of the marker polypeptide 

The present Invention provides assay techniques for detecting mutations in the encoding 
polynucleotide sequence. These methods include, but are not limited to, methods involv|p 
sequence analysis, Southern blot hybridization, restriction enzyme site mapping, and methods 
involving detection of absence of nucleotide pairing . between the polynucleotide to be analyzed 
and a probe. 

Specific diseases or disorders, e.g., genetic diseases or disorders, are associated with specific 
allelic variants of polymorphic regions of certain genes, which do not necessarily encode a 
mutated protein. Thus, the presence of a specific allelic variant of a polymorphic region of a gene 
in a subject can render the subject susceptible to developing a specific disease or disorder. 
Polymorphic regions in genes, can be identified, by determining the nucleotide sequence of genes 
in populations of individuals. If a polymorphic region is identified, then the link with a specific 
disease can be determined by studying specific populations of individuals, e.g. individuals which 
developed a specific disease, such as breast cancer. A polymorphic region can be located in any 
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region of a gene, e.g., exons, in coding or non coding regions of exons, introns, and promote 
region. 

In an exemplary embodiment, there is provided a polynucleotide composition comprising s 
polynucleotide probe including a region of nucleotide sequence which is capable of hybridising t< 
a sense or antisense sequence of a gene or naturally occurring mutants thereof, or 5' or 3' flankm* 
sequences or intronic sequences naturally associated with the subject genes or naturally occurrinf 
mutants thereof. The polynucleotide of a cell is rendered accessible for hybridization, the probe i: 
contacted with the polynucleotide of the sample, and the hybridization of the probe to the sample 
polynucleotide is detected. Such techniques can be used to detect lesions or allelic variants a( 
either the genomic or mRNA level, including deletions, substitutions, etc., as well as to determine 
mRNA transcript levels. 

A preferred detection method is allele specific hybridization using probes overlapping the 
mutation or polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around the 
mutation or polymorphic region. In a preferred embodiment of the invention, several probes 
capable of hybridising specifically to allelic variants are attached to a solid phase support, e.g., a 
"chip". Mutation detection analysis using these chips comprising oligonucleotides, also termed 
"DNA probe arrays" is described e.g., in Cronin et al. (1 19). In one embodiment, a chip comprises 
all the allelic variants of at least one polymorphic region of a gene. The solid phase support is then 
contacted with a test polynucleotide and hybridization to the specific probes is detected. 
Accordingly, the identity of numerous allelic variants of one or more genes can be identified in a 
simple hybridization experiment 

In certain embodiments, detection of the lesion comprises utilizing the probe/primer in a 
polymerase chain reaction (PCR) (see, e.g. U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PCR or RACE PCR, or, alternatively, in a ligase chain reaction (LCR) [Landegran et al., 
1988, (120) and Nakazawa et al., 1994 (121)], the latter of which can be particularly useful for 
detecting point mutations in the gene; Abravaya et al., 1995 ,(122)]. In a merely illustrative 
embodiment, the method includes the steps of (i) collecting a sample of cells from a patient, (ii) 
isolating polynucleotide (e.g., genomic, mRNA or both) from the cells of the sample, (iii) 
contacting the polynucleotide sample with one or more primers which specifically hybridize to a 
polynucleotide sequence under conditions such that hybridization and amplification of the 
polynucleotide (if present) occurs, and (iv) detecting the presence or absence of an amplification 
product, or detecting the size of the amplification product and comparing the length to a control 
sample. It is anticipated that PCR and/or LCR may be desirable to use as a preliminary 
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amplification step in conjunction with any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self sustained sequence replication [Guatelli, J.C. et al., 
1990, (123)], transcriptional amplification system [Kwoh, D.Y. et al., 1989, (124)), Q-Beta 
replicase [Lizardi, P.M. et al., 1988 ,(125)], or any other polynucleotide amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of skill 
in the art. These detection schemes are especially useful for the detection of polynucleotide 
molecules if such molecules are present in very low numbers 

In a preferred embodiment of the subject assay, mutations in, or allelic variants, of a gene from a 
sample cell are identified by alterations in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified (optionally), digested with one or more restricti^ 
endonucleases, and fragment length sizes are determined by gel electrophoresis. Moreover; the use 
of sequence specific ribozymes (see, for example, U.S. Patent No. 5,498,531) can be used to score 
for the presence of specific mutations by development or loss of a ribozyme cleavage site. 

In situ hybridization 

In one aspect, the method comprises in situ hybridization with a probe derived from a given 
marker polynucleotide, which sequence is selected from any of the polynucleotide sequences of 
the SEQ ID NO: 1 to 9, or 1 1 to 19 or 21 to 26 and 53 to 75 or a sequence complementary thereto. 
The method comprises contacting the labeled hybridization probe with a sample of a given type of 
tissue from a patient potentially having malignant neoplasia and breast cancer in particular as well 
as normal tissue from a person with no malignant neoplasia, and determining whether the probe 
labels tissue of the patient to a degree significantly different (e.g., by at least a factor of two, 
least a factor of five, or at least a factor of twenty, or at least a factor of fifty) than the degree to 
which normal tissue is labelled. 

Polypeptide detection 

The subject invention further provides a method of determining whether a cell sample obtained 
from a subject possesses an abnormal amount of marker polypeptide which comprises (a) 
obtaining a cell sample from the subject, (b) quantitatively determining the amount of the marker 
polypeptide in the sample so obtained, and (c) comparing the amount of the marker polypeptide so 
determined with a known standard, so as to thereby determine whether the cell sample obtained 
from the subject possesses an abnormal amount of the marker polypeptide. Such marker 
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polypeptides may be detected by immunohistochemical assays, dot-blot assays, ELISA and th 
like. 

Antibodies 

Any type of antibody known in the art can be generated to bind specifically to an epitope of i 
,3REAST CANCER GENE" polypeptide. An antibody as used herein includes intact immuno 
globulin molecules, as well as fragments thereof, such as Fab, F(ab) 2 , and Fv, which are capable o: 
binding an epitope of a ,3REAST CANCER GENE" polypeptide. Typically, at least 6, 8, 10, oi 
12 contiguous amino acids are required to form an epitope. However, epitopes which involve non- 
contiguous amino acids may require more, e.g., at least 15, 25, or 50 amino acids. 

An antibody which specifically binds to an epitope of a „BREAST CANCER GENE" polypeptide 
can be used therapeutically, as well as in immunochemical assays, such as Western blots, ELISAs, 
radioimmunoassays, immunohistochemical assays, immunoprecipitations, or other 
immunochemical assays known in the art. Various immunoassays can be used to identify 
antibodies having the desired specificity. Numerous protocols for competitive binding or 
immunoradiometric assays are well known in the art. Such immunoassays typically involve the 
measurement of complex formation between an immunogen and an antibody which specifically 
binds to the immunogen. 

Typically, an antibody which specifically binds to a .JBREAST CANCER GENE" polypeptide 
provides a detection signal at least 5-, 10-, or 20-fold higher than a detection signal provided with 
other proteins when used in an immunochemical assay. Preferably, antibodies which specifically 
bind to .3REAST CANCER GENE" polypeptides do not detect other proteins in 
immunochemical assays and can immunoprecipitate a ,3REAST CANCER GENE" polypeptide 
from solution. 

„BREAST CANCER GENE" polypeptides can be used to immunize a mammal, such as a mouse, 
rat, rabbit, guinea pig, monkey, or human, to produce polyclonal antibodies. If desired, a 
.3REAST CANCER GENE" polypeptide can be conjugated to a carrier protein, such as bovine 
serum albumin, thyroglobulin, and keyhole limpet hemocyanin. Depending on the host species, 
various adjuvants can be used to increase the immunological response. Such adjuvants include, 
but are not limited to, Freund's adjuvant, mineral gels (e.g., aluminum hydroxide), and surface 
active substances (e.g. lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole 
limpet hemocyanin, and dinitrophenol). Among adjuvants used in humans, BCG (bacilli 
Calmette-Guerin) and Corynebacterium parvum are especially useful. 
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Monoclonal antibodies which specifically bind to a „BREAST CANCER GENE" polypeptide can 
be prepared using any technique which provides for the production of antibody molecules by 
continuous cell lines in culture. These techniques include, but are not limited to, the hybridoma 
technique, the human B cell hybridoma technique, and the EBV hybridoma technique [Kohler et 
al., 1985, (136); Kozbor et al., 1985, (137); Cote et al., 1983, (138) and Cole et al., 1984, (139)]. 

In addition, techniques developed for the production of chimeric antibodies, the splicing of mouse 
antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity 
and biological activity, can be used [Morrison et al., 1984, (140); Neuberger et al., 1984, 041); 
Takeda et al., 1985, (142)]. Monoclonal and other antibodies also can be humanized to prevent a 
patient from mounting an immune response against the antibody when it is used therapeutically. 
Such antibodies may be sufficiently similar in sequence to human antibodies to be used directly in 
therapy or may require alteration of a few key residues. Sequence differences between rode^ 
antibodies and human sequences can be minimized by replacing residues which differ from those 
in the human sequences by site directed mutagenesis of individual residues or by grating of entire 
complementarity determining regions. Alternatively, humanized antibodies can be produced using 
recombinant methods, as described in GB2188638B. Antibodies which specifically bind to a 
„BREAST CANCER GENE" polypeptide can contain antigen binding sites which are either 
partially or fully humanized, as disclosed in U.S. Patent 5,565,332. 

Alternatively, techniques described for the production of single chain antibodies can be adapted 
using methods known in the art to produce single chain antibodies which specifically bind to 
,3REAST CANCER GENE" polypeptides. Antibodies with related specificity, but of distinct 
idiotypic composition, can be generated by chain shuffling from random combinatorial 
immunoglobulin libraries [Burton, 1991, (143)]. ^ 

Single-chain antibodies also can be constructed using a DNA amplification method, such as PCR, 
using hybridoma cDNA as a template [Thirion et al., 1996, (144)]. Single-chain antibodies can be 
mono- or bispecific, and can be bivalent or tetravalent. Construction of tetravalent, bispecific 
single-chain antibodies is taught, for example, in Coloma & Morrison, (145). Construction of 
bivalent, bispecific single-chain antibodies is taught in Mallender & Voss, (146). 

A nucleotide sequence encoding a single-chain antibody can be constructed using manual or 
automated nucleotide synthesis, cloned into an expression construct using standard recombinant 
DNA methods, and introduced into a cell to express the coding sequence, as described below. 
Alternatively, single-chain antibodies can be produced directly using, for example, filamentous 
phage technology [Verhaar et al., 1995, (147); Nicholls et al., 1993, (148)]. 



BHC03 1 051 



- 93 - 

Antibodies which specifically bind to .3REAST CANCER GENE" polypeptides also can b 
produced by inducing in vivo production in the lymphocyte population or by screeninj 
immunoglobulin libraries or panels of highly specific binding reagents as disclosed in th. 
literature [Orlandi et al., 1989, (149) and Winter et ah, 1991, (150)]. 

Other types of antibodies can be constructed and used therapeutically in methods of the invention 
For example, chimeric antibodies can be constructed as disclosed in WO 93/03151. Bindinj 
proteins which are derived from immunoglobulins and which are multivalent and multispecific 
such as the antibodies described in WO 94/13804, also can be prepared. 

Antibodies according to the invention can be purified by methods well known in the art. Foi 
example, antibodies can be affinity purified by passage over a column to which a .JBREASl 
CANCER GENE" polypeptide is bound. The bound antibodies can then be eluted from the columr 
using a buffer with a high salt concentration. 

Immunoassays are commonly used to quantify the levels of proteins in cell samples, and many 
other immunoassay techniques are known in the art. The invention is not limited to a particular 
assay procedure, and therefore is intended to include both homogeneous and heterogeneous 
procedures. Exemplary immunoassays which can be conducted according to the invention include 
fluorescence polarisation immunoassay (FPIA), fluorescence immunoassay (FIA), enzyme 
immunoassay (EIA), nephelometric inhibition immunoassay (NIA), enzyme \ linked 
immunosorbent assay (ELISA), and radioimmunoassay (RIA). An indicator moiety, or label 
group, can be attached to the subject antibodies and is selected so as to meet the needs of various 
uses of the method which are often dictated by the availability of assay equipment and compatible 
immunoassay procedures. General techniques to be used in performing the various immunoassays 
noted above are known to those of ordinary skill in the art. 

In another embodiment, the level of at least one product encoded by any of the polynucleotide 
sequences of the SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19 or 21 to 26 or 53 to 75 or of at least 2 
products encoded by a polynucleotide selected from SEQ ID NO: 1 to 26 and 53 to 75 or a 
sequence complementary thereto, in a biological fluid (e.g., blood or urine) of a patient may be 
determined as a way of monitoring the level of expression of the marker polynucleotide sequence 
in cells of that patient. Such a method would include the steps of obtaining a sample of a 
biological fluid from the patient, contacting the sample (or proteins from the sample) with an 
antibody specific for a encoded marker polypeptide, and determining the amount of immune 
complex formation by the antibody, with the amount of immune complex formation being 
indicative of the level of the marker encoded product in the sample. This determination is 
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particularly instructive when compared to the amount of immune complex formation by the same 
antibody in a control sample taken from a normal individual or in one or more samples previously 
or subsequently obtained from the same person. 

In another embodiment, the method can be used to determine the amount of marker polypeptide 
present in a cell, which in turn can be correlated with progression of the disorder, e.g., plaque 
formation. The level of the marker polypeptide can be used predictively to evaluate whether a 
sample of cells contains cells which are, or are predisposed towards becoming, plaque associated 
cells. The observation of marker polypeptide level can be utilized in decisions regarding, e.g., the 
use of more stringent therapies. 

As set out above, one aspect of the present invention relates to diagnostic assays for determining, 
in the context of cells isolated from a patient, if the level of a marker polypeptide is significan^ 
reduced in the sample cells. The term "significantly reduced" refers to a cell phenotype wherein 
the cell possesses a reduced cellular amount of the marker polypeptide relative to a normal cell of 
similar tissue origin. For example, a cell may have less than about 50%, 25%, 10%, or 5% of the 
marker polypeptide that a normal control cell. In particular, the assay evaluates the level of marker 
polypeptide in the test cells, and, preferably, compares the measured level with marker 
polypeptide detected in at least one control cell, e.g., a normal cell and/or a transformed cell of 
known phenotype. 

Of particular importance to the subject invention is the ability to quantify the level of marker 
polypeptide as determined by the number of cells associated with a normal or abnormal marker 
polypeptide level. The number of cells with a particular marker polypeptide phenotype may then 
be correlated with patient prognosis. In one embodiment of the invention, the marker polypept^ 
phenotype of the lesion is determined as a percentage of cells in a biopsy which are found to h^ 
abnormally high/low levels of the marker polypeptide. Such expression may be detected by 
immunohistochemical assays, dot-blot assays, ELISA and the like. 

Immunohistochemistrv 

Where tissue samples are employed, immunohistochemical staining may be used to determine the 
number of cells having the marker polypeptide phenotype. For such staining, a multiblock of 
tissue is taken from the biopsy or other tissue sample and subjected to proteolytic hydrolysis, 
employing such agents as protease K or pepsin. In certain embodiments, it may be desirable to 
isolate a nuclear fraction from the sample cells and detect the level of the marker polypeptide in 
the nuclear fraction. 
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The tissues samples are fixed by treatment with a reagent such as formalin, glutaraldehy, 
methanol, or the like. The samples are then incubated with an antibody, preferably a monoclot 
antibody, with binding specificity for the marker polypeptides. This antibody may be conjugal 
to a Label for subsequent detection of binding, samples are incubated for a time Sufficient 1 
formation of the immunocomplexes. Binding of the antibody is then detected by virtue of a Lab 
conjugated to this antibody. Where the antibody is unlabelled, a second labeled antibody may i 
employed, e.g., which is specific for the isotype of the anti-marker polypeptide antibod 
Examples of labels which may be employed include radionuclides, fluoresces 
chemiluminescence, and enzymes. 

Where enzymes are employed, the Substrate for the enzyme may be added to the samples i 
provide a colored or fluorescent product. Examples of suitable enzymes for use in conjugate 
include horseradish peroxidase, alkaline phosphatase, malate dehydrogenase and the like. Whei 
not commercially available, such antibody-enzyme conjugates are readily produced by technique 
known to those skilled in the art. 

In one embodiment, the assay is performed as a dot blot assay. The dot blot assay finds particula 
application where tissue samples are employed as it allows determination of the average amount o 
the marker polypeptide associated with a Single cell by correlating the amount of Jmarke 
polypeptide in a cell-free extract produced from a predetermined number of cells. 

In yet another embodiment, the invention contemplates using one or more antibodies which are 
generated against one or more of the marker polypeptides of this invention, which polypeptides are 
encoded by any of the polynucleotide sequences of the SEQ ID NO: 1 to 26 or 53 to 75. Such a 
panel of antibodies may be used as a reliable diagnostic probe for breast cancer. The assay of the 
present invention comprises contacting a biopsy sample containing cells, e.g., macrophages, with a 
panel of antibodies to one or more of the encoded products to determine the presence or absence of 
the marker polypeptides. 

The diagnostic methods of the subject invention may also be employed as follow-up to treatment, 
e.g., quantification of the level of marker polypeptides may be indicative of the effectiveness of 
current or previously employed therapies for malignant neoplasia and breast cancer in particular as 
well as the effect of these therapies upon patient prognosis. 

The diagnostic assays described above can be adapted to be used as prognostic assays, as well. 
Such an application takes advantage of the sensitivity of the assays of the Invention to events 
which take place at characteristic stages in the progression of plaque generation in case of 
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malignant neoplasia. For example, a given marker gene may be up- or down-regulated at a very 
early stage, perhaps before the cell is developing into a foam cell, while another marker gene may 
be characteristically up or down regulated only at a much later stage. Such a method could involve 
the steps of contacting the mRNA of a test cell with a polynucleotide probe derived from a given 
marker polynucleotide which is expressed at different characteristic levels in breast cancer tissue 
cells at different stages of malignant neoplasia progression, and determining the approximate 
amount of hybridization of the probe to the mRNA of the cell, such amount being an indication of 
the level of expression of the gene in the cell, and thus an indication of the stage of disease 
progression of the cell; alternatively, the assay can be carried out with an antibody specific for the 
gene product of the given marker polynucleotide, contacted with the proteins of the test cell. A 
battery of such tests will disclose not only the existence of a certain arteriosclerotic plaque, but 
also will allow the clinician to select the mode of treatment most appropriate for the disease, an^ 
to predict the likelihood of success of that treatment. 

The methods of the invention can also be used to follow the clinical course of a given breast 
cancer predisposition. For example, the assay of the Invention can be applied to a blood sample 
from a patient; following treatment of the patient for BREAST CANCER, another blood sample is 
taken and the test repeated. Successful treatment will result in removal of demonstrate differential 
expression, characteristic of the breast cancer tissue cells, perhaps approaching or even surpassing 
normal levels. 

Pnlype ptide activity 

In one embodiment the present invention provides a method for screening potentially therapeutic 
agents which modulate the activity of one or more "BREAST CANCER GENE" polypeptide| 
such that if the activity of the polypeptide is increased as a result of the upregulation ofW 
"BREAST CANCER GENE" in a subject having or at risk for malignant neoplasia and breast 
cLer in particular, the therapeutic substance will decrease the activity of the polypeptide relative 
to the activity of the some polypeptide in a subject not having or not at risk for malignant 
neoplasia or breast cancer in particular but not treated with the therapeutic agent. Likewise, if the 
activity of the polypeptide as a result of the downregulation of the "BREAST CANCER GENE" is 
decreased in a subject having or at risk for malignant neoplasia or breast cancer in particular, the 
therapeutic agent will increase the activity of the polypeptide relative to the activity of the same 
polypeptide in a subject not having or not at risk for malignant neoplasia or breast cancer in 
particular, but not treated with the therapeutic agent 
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The activity of the "BREAST CANCER GENE" polypeptides indicated in Table 2 or 3 may 
measured by any means known to those of skill in the art, and which are particular for the type 
activity performed by the particular polypeptide. Examples of specific assays which may be us, 
to measure the activity of particular polynucleotides are shown below. 

a) G protein coupled receptors 

In one embodiment, the "BREAST CANCER GENE" polynucleotide may encode a G prote: 
coupled receptor. In one embodiment, the present invention provides a method of screenin 
potential modulators (inhibitors or activators) of the G protein coupled receptor by measurin 
changes in the activity of the receptor in the presence of a candidate modulator. 



/■ Gi -coupled receptors 

Cells (such as CHO cells or primary cells) are stably transfected with the relevant receptor an, 
with an inducible CRE-luciferase construct. Cells are grown in 50% Dulbecco's modified Eagl, 
medium / 50% F12 (DMEMZF12) supplemented with 10% FBS, at 37°C in a humidifie< 
atmosphere with 10% CO, and are routinely split at a ratio of 1 : 1 0 every 2 or 3 days. Test culture: 
are seeded into 384 - well plates at an appropriate density (e.g. 2000 cells / well in 35 ul eel 
culture medium) in DMEM/F12 with FBS, and are grown for 48 hours (range: -24 - 60 hours 
depending on cell line). Growth medium is then exchanged against serum free medium (SFM; e.g 
Ultra-CHO), containing 0,1% BSA. Test compounds dissolved in DMSO are diluted in SFm' and 
transferred to the test cultures (maximal final concentration 10 umolar), followed by addition of 
forskolin (~ 1 umolar, final cone.) in SFM + 0,1% BSA 10 minutes later. In case of antagonist 
screening both, an appropriate concentration of agonist, and forskolin are added. The plates are 
incubated at 37°C in 10% CO, for 3 hours. Then the supernatant is removed, cells are lysed with 
lysis reagent (25 mmolar phosphate-buffer, pH 7,8, containing 2 mmolar DDT, 10% glycerol and 
3% Triton X100). The luciferase reaction is started by addition of substrate-buffer (e.g. luciferase 
assay reagent, Promega) and luminescence is immediately determined (e.g. Berthold luminometer 
or Hamamatzu camera system). 

2 . G. -coupled receptors 

Cells (such as CHO cells or primary cells) are stably transfected with the relevant receptor and 
with an inducible CRE-luciferase construct Cells are grown in 50% Dulbecco's modified Eagle 
medium / 50% F12 (DMEM/F12) supplemented with 10% FBS, at 37°C in a humidified 



BHC03 1 051 



98 



atmosphere with 10% C0 2 and are routinely split at a ratio of 1:10 every 2 or 3 days. Test cultures 
are seeded into 384 - well plates at an appropriate density (e.g. 1000 or 2000 cells / well in 35 ul 
cell culture medium) in DMEM/F12 with FBS, and are grown for 48 hours (range: -24 - 60 hours, 
depending on cell line). The assay is started by addition of test-compounds in serum free medium 
(SFM; e.g. Ultra-CHO) containing 0,1% BSA: Test compounds are dissolved in DMSO, diluted in 
SFM and transferred to the test cultures (maximal final concentration 10 umolar, DMSO cone. < 
0,6 %). In case of antagonist screening an appropriate concentration of agonist is added 5-10 
minutes later. The plates are incubated at 37°C in 10% CO, for 3 hours. Then the cells are lysed 
with 10 ul lysis reagent per well (25 mmolar phosphate-buffer, pH 7,8 , containing 2 mmolar 
DDT, 10% glycerol and 3% Triton XI 00) and the luciferase reaction is started by addition of 20 ul 
substrate-buffer per well (e.g. luciferase assay reagent, Promega). Measurement of luminescence is 
started immediately (e.g. Berthold luminometer or Hamamatzu camera system). 

3 . G q -coupled receptors 

Cells (such as CHO cells or primary cells) are stably transfected with the relevant receptor. Cells 
expressing Junctional receptor protein are grown in 50% Dulbecco's modified Eagle medium / 
50% F12 (DMEM/F12) supplemented with 10% FBS, at 37°C in a humidified atmosphere with 
5% CO2 and are routinely split at a cell line dependent ratio every 3 or 4 days. Test cultures are 
seeded into 384 - well plates at an appropriate density (e.g. 2000 cells / well in 35 ul cell culture 
medium) in DMEM/F12 with FBS, and are grown for 48 hours (range: -24 - 60 hours, depending 
on cell line). Growth medium is then exchanged against physiological salt solution (e.g. Tyrode 
solution). Test compounds dissolved in DMSO are diluted in Tyrode solution containing 0.1% 
BSA and transferred to the test cultures (maximal final concentration 10 umolar). After addition of 
the receptor specific agonist the resulting Gq-mediated intracellular calcium increase is measur^ 
using appropriate read-out systems (e.g. calcium-sensitive dyes). 

b) Ion channels 

Ion channels are integral membrane proteins involved in electrical signaling, transmembrane 
signal transduction, and electrolyte and solute transport By forming macromolecular pores 
through the membrane lipid bilayer, ion channels account for the flow of specific ion species 
driven by the electrochemical potential gradient for the permeating ion. At the single molecule 
level, individual channels undergo conformational transitions ("gating") between the 'open' (ion 
conducting) and 'closed' (non conducting) state. Typical single channel openings last for a few 
milliseconds and result in elementary transmembrane currents in the range of 10" 9 - 10 12 Ampere. 
Channel gating is controlled by various chemical and/or biophysical parameters, such as 
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neurotransmitters and intracellular second messengers ('ligand-gated' channels) or membrane 
potential (Voltage-gated* channels). Ion channels are functionally characterized by their ion 
selectivity, gating properties, and regulation by hormones and pharmacological agents. Because of 
their central role in signaling and transport processes, ion channels present ideal targets for 
pharmacological therapeutics in various pathophysiological settings. 

In one embodiment, the "BREAST CANCER GENE" may encode an ion channel. In one 
embodiment, the present invention provides a method of screening potential activators or 
inhibitors of channels activity of the "BREAST CANCER GENE" polypeptide. Screening for 
compounds interaction with ion channels to either inhibit or promote their activity can be based on 
(1.) binding and (2.) functional assays in living cells[ Hille (183)]. 

1. For ligand-gated channels, e.g. ionotropic neurotransmitter/hormone receptors, assays can 
be designed detecting binding to the target by competition between the compound and a 
labeled ligand. 

2. Ion channel function can be tested functionally in living cells. Target proteins are either 
expressed endogenously in appropriate reporter cells or are introduced recombinantly. 
Channel activity can be monitored by (2.1) concentration changes of the permeating ion 
(most prominently Ca 2+ ions), (2.2) by changes in the transmembrane electrical potential 
gradient, and (2.3) by measuring a cellular response (e.g. expression of a reporter gene, 
secretion of a neurotransmitter) triggered or modulated by the target activity. 

2.1 Channel activity results in transmembrane ion fluxes. Thus activation of ionic channels 
can be monitored by the resulting changes in intracellular ion concentrations using 
luminescent or fluorescent indicators. Because of its wide dynamic range and availability 
of suitable indicators this applies particularly to changes in intracellular Ca 2+ ion 
concentration ([Ca 2+ ]i). [Ca 2+ ]j can be measured, for example, by aequorin luminescence or 
fluorescence dye technology (e.g. using Fluo-3, Indo-1, Fura-2). Cellular assays can be 
designed where either the Ca 2+ flux through the target channel itself is measured directly 
or where modulation of the target channel affects membrane potential and thereby the 
activity of co-expressed voltage-gated Ca 2+ channels. 

2.2 Ion channel currents result in changes of electrical membrane potential (V^) which can be 
monitored directly using potentiometric fluorescent probes. These electrically charged 
indicators (e.g. the anionic oxonol dye DiBAC 4 (3)) redistribute between extra- and 
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intracellular compartment in response to voltage changes. The equilibrium distribution is 
governed by the Nernst-equation. Thus changes in membrane potential results in 
concomitant changes in cellular fluorescence. Again, changes in V m might be caused 
directly by the activity of the target ion channel or through amplification and/or 
prolongation of the signal by channels co-expressed in the same cell. 

2.3 Target channel activity can cause cellular Ca 2+ entry either directly or through activation 
of additional Ca 2+ channel (see 2.1). The resulting intracellular Ca 2+ signals regulate a 
. variety, of cellular responses, e.g. secretion or gene transcription. Therefore modulation of_ 
the target channel can be detected by monitoring secretion of a known 
hormone/transmitter from the target-expressing cell or through expression of a reporter 
gene (e.g. luciferase) controlled by an Ca 2+ -responsive promoter element (e.g. cyclic 
AMP/ Ca 2+ -responsive elements; CRE). ^) 

c) DNA-binding proteins and transcription factors 

In one embodiment, the "BREAST CANCER GENE" may encode a DNA-binding protein or a 
transcription factor. The activity of such a DNA-binding protein or a transcription factor may be 
measured, for example, by a promoter assay which measures the ability of the DNA-binding 
protein or the transcription factor to initiate transcription of a test sequence linked to a particular 
promoter. In one embodiment, the present invention provides a method of screening test 
compounds for its ability to modulate the activity of such a DNA-binding protein or a transcription 
factor by measuring the changes in the expression of a test gene which is regulated by a promoter 
which is responsive to the transcription factor. 

d) Promotor assays ^) 

A. promoter assay was set up with a human hepatocellular carcinoma cell HepG2 that was stably 
transfected with a luciferase gene under the control of a gene of interest (e.g. thyroid hormone) 
regulated promoter. The vector 2xIR01uc, which was used for transfection, carries a thyroid 
hormone responsive element (TRE) of two 12 bp inverted palindromes separated by an 8 bp spacer 
in front of a tk minimal promoter and the luciferase gene. Test cultures were seeded in 96 well 
plates in serum - free Eagle's Minimal Essential Medium supplemented with glutamine, tricine, 
sodium pyruvate, non - essential amino acids, insulin, selen, transferrin, and were cultivated in a 
humidified atmosphere at 10 % C0 2 at 37°C. After 48 hours of incubation serial dilutions of test 
compounds or reference compounds (L-T3, L-T4 e.g.) and co-stimulator if appropriate (final 
concentration 1 nM) were added to the cell cultures and incubation was continued for the optimal 
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time (e.g. another 4-72 hours). The cells were then lysed by addition of buffer containing Trit, 
XlOO and luciferin and the luminescence of luciferase induced by T3 or other compounds w 
measured in a luminometer. For each concentration of a test compound replicates of 4 were teste 
EC 50 - values for each test compound were calculated by use of the Graph Pad Prism Scientil 
software. 

Screening Methods 

The invention provides assays for screening test compounds which bind to or modulate the activil 
of a ..BREAST CANCER GENE" polypeptide or a .3REAST CANCER GENE" polynucleotid, 
A test compound preferably binds to a .3REAST CANCER GENE" polypeptide c 
polynucleotide. More preferably, a test compound decreases or increases „BREAST CANCE] 
GENE" activity by at least about 10, preferably about 50, more preferably about 75, 90, or 1009 
relative to the absence of the test compound. 

Test Compounds 

Test compounds can be pharmacological agents already known in the art or can be compound 
previously unknown to have any pharmacological activity. The compounds can be naturalh 
occurring or designed in the laboratory. They can be isolated from microorganisms, animals, o, 
plants, and can be produced recombinant, or synthesised by chemical methods known in the art. t 
desired, test compounds can be obtained using any of the numerous combinatorial library niethod* 
known in the art, including but not limited to, biological libraries, spatially addressable parallel 
solid phase or solution phase libraries, synthetic library methods requiring deconvolution, the one- 
bead one-compound library method, and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to polypeptide libraries, while the other four 
approaches are applicable to polypeptide, non-peptide oligomer, or small molecule libraries of 
compounds. [For review see Lam, 1997,(151)]. 

Methods for the synthesis of molecular libraries are well known in the art [see, for example, 
DeWitt et al., 1993, (152); Erb et al., 1994, (153); Zuckermann et al., 1994, (154); Cho et al.,' 
1993, (155); Carell et al., 1994, (156) and Gallop et al., 1994, (157). Libraries of compounds can 
be presented in solution [see, e.g., Houghten, 1992, (158)], or on beads [Lam, 1991, (159)], DNA- 
chips [Fodor, 1993, (160)], bacteria or spores (Ladner, U.S. Patent 5,223,409), plasmids [Cull et 
al., 1992, (161)], or phage [Scott & Smith, 1990, (162); Devlin, 1990, (163); Cwirla et al., 1990, 
(164); Felici, 1991,(165)]. 
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High Throughput Screening 

Test compounds can be screened for the ability to bind to „BREAST CANCER GENE" 
polypeptides or polynucleotides or to affect „BREAST CANCER GENE" activity or ,3REAST 
CANCER GENE" expression using high throughput screening. Using high throughput screening, 
many discrete compounds can be tested in parallel so that large numbers of test compounds can be 
quickly screened. The most widely established techniques utilize 96-well, 384-well or 1536-well 
microtiter plates. The wells of the microtiter plates typically require assay volumes that range from 
5 to 500 In addition to the plates, many instruments, materials, pipettors, robotics, plate 
washers, and plate readers are commercially available to fit the microwell formats. 

Alternatively, free format assays, or assays that have no physical barrier between samples, can be 
used. For example, an assay using pigment cells (melanocytes) in a simple homogeneous assay ^ 
combinatorial peptide libraries is described by Jayawickreme et al., (166). The cells are placed 
under agarose in culture dishes, then beads that carry combinatorial compounds are placed on the 
surface of the agarose. The combinatorial compounds are partially released the compounds from 
the beads. Active compounds can be visualised as dark pigment areas because, as the compounds 
diffuse locally into the gel matrix, the active compounds cause the cells to change colors. 

Another example of a free format assay is described by Chelsky, (167). Chelsky placed a simple 
homogenous enzyme assay for carbonic anhydrase inside an agarose gel such that the enzyme in 
the gel would cause a color change throughout the gel. Thereafter, beads carrying combinatorial 
compounds via a photolinker were placed inside the gel and the compounds were partially released 
by UV light. Compounds that inhibited the enzyme were observed as local zones of inhibition 
having less color change. ^ 

In another example, combinatorial libraries were screened for compounds that had cytotoxic 
effects on cancer cells growing in agar [Salmon et al., 1996, (168)]. 

Another high throughput screening method is described in Beutel et al., U.S. Patent 5,976,813. In 
this method, test samples are placed in a porous matrix. One or more assay components are then 
placed within, on top of, or at the bottom of a matrix such as a gel, a plastic sheet, a filter, or other 
form of easily manipulated solid support. When samples are introduced to the porous matrix they 
diffuse sufficiently slowly, such that the assays can be performed without the test samples running 
together. 
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Bindinfr Assays 

For binding assays, the test compound is preferably a small molecule which binds to and occupies 
for example, the ATP/GTP binding site of the enzyme or the active site of a .3REAST CANCE1 
GENE" polypeptide, such that normal biological activity is prevented. Examples of such smal 
molecules include, but are not limited to, small peptides or peptide-like molecules. 

In binding assays, either the test compound or a ,3REAST CANCER GENE" polypeptide cai 
comprise a detectable label, such as a fluorescent, radioisotopic, chemiluminescent, or enzymati. 
label, such as horseradish peroxidase, alkaline phosphatase, or luciferase. Detection of a tes 
compound which is bound to a „BREAST CANCER GENE" polypeptide can then be 
accomplished, for example, by direct counting of radioemmission, by scintillation counting, or bj 
determining conversion of an appropriate substrate to a detectable product. 

Alternatively, binding of a test compound to a ,3REAST CANCER GENE" polypeptide can be 
determined without labeling either of the interactants. For example, a microphysiometer can be 
used to detect binding of a test compound with a ,3REAST CANCER GENE" polypeptide. A 
microphysiometer (e.g., CytosensorJ) is an analytical instrument that measures the rate at.which a 
cell acidifies its environment using a light-addressable potentiometric sensor (LAPS). Changes in 
this acidification rate can be used as an indicator of the interaction between a test compound and a 
.3REAST CANCER GENE" polypeptide [McCoimell et al., 1 992, (1 69)]. 

4 

Determining the ability of a test compound to bind to a .3REAST CANCER GENE" polypeptide 
also can be accomplished using a technology such as real-time Bimolecular Interaction Analysis 
(BIA) [Sjolander & Urbaniczky, 1991, (170), and Szabo et al, 1995, (171)]. BIA is a technology 
for studying biospecific interactions in real time, without labeling any of the interactants (e.g., 
BIAcore™). Changes in the optical phenomenon surface plasmon resonance (SPR) can be used as 
an indication of real-time reactions between biological molecules. 

In yet another aspect of the invention, a ,3REAST CANCER GENE" polypeptide can be used as a 
"bait protein" in a two-hybrid assay or three-hybrid assay [see, e.g., U.S. Patent 5,283,317; Zervos 
et al, 1993, (172); Madura et al, 1993, (173); Bartel et al, 1993, (174); Iwabuchi et al, 1993, 
(175) and Brent WO 94/10300], to identify other proteins which bind to or interact with the 
.3REAST CANCER GENE" polypeptide and modulate its activity. 

The two-hybrid system is based on the modular nature of most transcription factors, which consist 
of separable DNA-binding and activation domains. Briefly, the assay utilizes two different DNA 
constructs. For example, in one construct, polynucleotide encoding a .3REAST CANCER GENE" 
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polypeptide can be fused to a polynucleotide encoding the DNA binding domain of a known 
transcription factor (e.g., GAL4). In the other construct a DNA sequence that encodes an 
unidentified protein ("prey" or "sample") can be fused to a polynucleotide that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins are able 
to interact in vivo to form an protein- dependent complex, the DNA-bmding and activation 
domains of the transcription factor are brought into close proximity. This proximity allows 
transcription of a reporter gene (e.g., LacZ), which is operably linked to a transcriptional 
regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
detected, and cell colonies containing the functional transcription factor can be isolated and used 
to obtain the DNA sequence encoding the protein which interacts with the „BREAST CANCER 
GENE" polypeptide. 

It may be desirable to immobilize either a „BREAST CANCER GENE" polypeptide 
polynucleotide) or the test compound to facilitate separation of bound from unbound forms of one 
or. both of theinteractants, as well , as to accommodate automation of the assay. Thus, either a 
„BREAST CANCER GENE" polypeptide (or polynucleotide) or the test compound can be bound 
to a solid support. Suitable solid supports include, but are not limited to, glass or plastic slides, 
tissue culture plates, microtiter wells, tubes, silicon chips, or particles such as beads (including, 
but not limited to, latex, polystyrene, or glass beads). Any method known in the art can be used to 
attach a „BREAST CANCER GENE" polypeptide (or polynucleotide) or test compound to a solid 
support, including use of covalent and non-covalent linkages, passive absorption, or pairs of 
binding moieties attached respectively to the polypeptide (or polynucleotide) or test compound 
and the solid support. Test compounds are preferably bound to the solid support in an array, so that 
the location of individual test compounds can be tracked. Binding of a test compound to a 
„BREAST CANCER GENE" polypeptide (or polynucleotide) can be accomplished in any ve<^ 
suitable for containing the reactants. Examples of such vessels include microtiter plates, test tubes, 
and microcentrifuge tubes. 

In one embodiment, a ,3REAST CANCER GENE" polypeptide is a fusion protein comprising a 
domain that allows the ,3REAST CANCER GENE" polypeptide to be bound to a solid support. 
For example, glutathione S-transferase fusion proteins can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical, St. Louis, Mo.) or glutathione derivatized microtiter plates, which are 
then combined with the test compound or Ihe test compound and the nonadsorbed „BREAST 
CANCER GENE" polypeptide; the mixture is then incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for salt and pH). Following incubation, the 
beads or microtiter plate wells are washed to remove any unbound components. Binding of the 
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interactants can be determined either directly or indirectly, as described above. Alternatively , 
complexes can be dissociated from the solid support before binding is determined. 

Other techniques for immobilising proteins or polynucleotides on a solid support also can be us 
m the screening assays of the invention. For example, either a „BREAST CANCER GEN] 
polypeptide (or polynucleotide) or a test compound can be immobilized utilizing conjugation 
biotm and streptavidin. Biotinylated ..BREAST CANCER GENE" polypeptides ( 
polynucleotides) or test compounds can be prepared from biotin NHS (N-hydroxysuccinimid 
usmg techniques well known in the art (e.g.. biotinylation kit. Pierce Chemicals, Rockford 111 
and .mmobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical 
Alternately, antibodies which specifically bind to a ..BREAST CANCER GENE" polypeptid, 
polynucleotide, or a test compound, but which do not interfere with a desired binding site, such a 
the ATP/GTP binding site or the active site of the ..BREAST CANCER GENE" polypeptide ca 
be derivatised to the wells of the plate. Unbound target or protein can be trapped in the wells b 
antibody conjugation. 

Methods for detecting such complexes, in addition to those described above for the GST 
immobilized complexes, include immunodetection of complexes using antibodies • whicl 
specifically bind to a , 3 REAST CANCER GENE" polypeptide or test compound, enzyme-linkec 
assays wluch rely on detecting an activity of a , 3 REAST CANCER GENE" polypeptide, and SDS 
gel electrophoresis under non-reducing conditions. 

Screening for test compounds which bind to a ..BREAST CANCER GENE" polypeptide ox 
polynucleotide also can be carried out in an intact cell. Any cell which comprises a ..BREAST 
CANCER GENE" polypeptide or polynucleotide can be used in a cell-based assay system A 
..BREAST CANCER GENE" polynucleotide can be naturally occurring in the cell or can' be 
mtroduced using techniques such as those described above. Binding of the test compound to a 
..BREAST CANCER GENE" polypeptide or polynucleotide is determined as described above. 
Modulation of Gene Expression 

In another embodiment, test compounds which increase or decrease ..BREAST CANCER GENE" 
expression are identified. A ..BREAST CANCER GENE" polynucleotide is contacted with a test 
compound, and the expression of an RNA or polypeptide product of the , 3 REAST CANCFR 
GENE" polynucleotide is determined. The level of expression of appropriate mRNA or poly- 
peptide m the presence of the test compound is compared to the level of expression of mRNA or 
polypeptide in the absence of the test compound. The test compound can then be identified as a 
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modulator of expression based on this comparison. For example, when expression of mRNA or 
polypeptide is greater in the presence of the test compound than in its absence, the test compound 
is identified as a stimulator or enhancer of the mRNA or polypeptide expression. Alternatively, 
when expression of the mRNA or polypeptide is less in the presence of the test compound than in 
its absence, the test compound is identified as an inhibitor of the mRNA or polypeptide 
expression. 

The level of 3REAST CANCER GENE" mRNA or polypeptide expression in the cells can be 
determined by methods well known in the art for detecting. mRNA. or polypeptide. Either 
qualitative or quantitative methods can be used. The presence of polypeptide products of a 
„BREAST CANCER GENE" polynucleotide can be determined, for example, using a variety of 
techniques known in the art, including immunochemical methods such as radioimmunoassay, 
Western blotting, and immunohistochemistry. Alternatively, polypeptide synthesis can 1# 
determined in vivo, in a cell culture, or in an in vitro translation system by detecting incorporation 
of labeled amino acids into a , BREAST CANCER GENE" polypeptide. 

Such screening can be carried out either in a cell-free assay system or in an intact cell. Any cell 
which expresses a „BREAST CANCER GENE" polynucleotide can be used in a cell-based assay 
system. A „BREAST CANCER GENE" polynucleotide can be naturally occurring in the cell or 
can be introduced using techniques such as those described above. Either a primary culture or an 
established cell line, such as CHO or human embryonic kidney 293 cells, can be used. 

Therapeutic Indications and Methods 

Therapies for treatment of breast cancer primarily relied upon effective chemotherapeutic drugs 
for intervention on the cell proliferation, cell growth or angiogenesis. The advent of genomi^ 
driven molecular target identification has opened up the possibility of identifying new breast 
cancer-specific targets for therapeutic intervention that will provide safer, more effective 
treatments for malignant neoplasia patients and breast cancer patients in particular. Thus, newly 
discovered breast cancer-associated genes and their products can be used as tools to develop 
innovative therapies. The identification of the Her2/neu receptor kinase presents exciting new 
opportunities for treatment of a certain subset of tumor patients as described before. Genes playing 
important roles in any of the physiological processes outlined above can be characterized as breast 
cancer targets. Genes or gene fragments identified through genomics can readily be expressed in 
one or more heterologous expression systems to produce functional recombinant proteins. These 
proteins are characterized in vitro for their biochemical properties and then used as tools in high- 
throughput molecular screening programs to identify chemical modulators of their biochemical 
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activities. Modulators of target gene expression or protein activity can be identified in this man™ 
and subsequently tested in cellular and in vivo disease models for therapeutic activit 
Optimization of lead compounds with iterative testing in biological models and detaile 
pharmacokinetic and toxicological analyses form the basis for drug development and subsequei 
testing in humans. 

This invention further pertains to the use of novel agents identified by the screening assay 
described above. Accordingly, it is within the scope of this invention to use a test compoun 
identified as described herein in an appropriate animal model. For example, an agent identified a 
described herein (e.g., a modulating agent, an antisense polynucleotide molecule, a specifi. 
antibody, ribozyme, or a human „BREAST CANCER GENE" polypeptide binding molecule) cai 
be used in an animal model to determine the efficacy, toxicity, or side effects of treatment wit! 
such an agent. Alternatively, an agent identified as described herein can be used in an anima 
model to determine the mechanism of action of such an agent. Furthermore, this invention pertain: 
to uses of novel agents identified by the above described screening assays for treatments a: 
described herein. 

A reagent which affects human „BREAST CANCER GENE" activity can be administered to i 
human cell, either in vitro or in vivo, to reduce or increase human „BREAST CANCER GENE' 
activity. The reagent preferably binds to an expression product of a human .3REAST CANCER 
GENE". If the expression product is a protein, the reagent is preferably an antibody. For treatmenl 
of human cells ex vivo, an antibody can be added to a preparation of stem cells which have been 
removed from the body. The cells can then be replaced in the same or another human body, with 
or without clonal propagation, as is known in the art. 

In one embodiment, the reagent is delivered using a liposome. Preferably, the liposome is stable in 
the animal into which it has been administered for at least about 30 minutes, more preferably for at 
least about 1 hour, and even more preferably for at least about 24 hours. A liposome comprises a 
lipid composition that is capable of targeting a reagent, particularly a polynucleotide, to a 
particular site in an animal, such as a human. Preferably, the lipid composition of the liposome is 
capable of targeting to a specific organ of an animal, such as the lung, liver, spleen, heart brain, 
lymph nodes, and skin. 

A liposome useful in the present invention comprises a lipid composition that is capable of fusing 
with the plasma membrane of the targeted cell to deliver its contents to the cell. Preferably, the 
transfection efficiency of a liposome is about 0.5 ug of DNA per 16 nmol of liposome delivered to 
about 10 6 cells, more preferably about 1.0 ug of DNA per 16 nmol of liposome delivered to about 
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10 6 cells, and even more preferably about 2.0 pg of DNA per 16 nmol of liposome delivered to 
about 10 6 cells. Preferably, a liposome is between about 100 and 500 nm, more preferably between 
about 150 and 450 nm, and even more preferably between about 200 and 400 nm in diameter. 

Suitable liposomes for use in the present invention include those liposomes usually used in, for 
example, gene delivery methods known to those of skill in the art. More preferred liposomes 
include liposomes having a polycationic lipid composition and/or liposomes having a cholesterol 
backbone conjugated to polyethylene glycol. Optionally, a liposome comprises a compound 
capable of targeting the liposome to a particular cell type, such as a cell-specific ligand exposed on 
the outer surface of the liposome. 

Complexing a liposome with a reagent such as an antisense oligonucleotide or ribozyme can be 
achieved using methods which are standard in the art (see, for example, U.S. Patent 5,705,15D^ 
Preferably, from about 0.1 pg to about 10 pg of polynucleotide is combined with about 8 nmol ot 
liposomes, more preferably from about 0.5 pg to about 5 fig of polynucleotides are combined with 
about 8 nmol liposomes, and even more preferably about 1.6 fig of polynucleotides is combined 
with about 8 nmol liposomes. 

In another embodiment, antibodies can be delivered to specific tissues in vivo using receptor- 
mediated targeted delivery. Receptor-mediated DNA delivery techniques are taught in, for 
example, Findeis et al., 1993, (176); Chiou et al., 1994, (177); Wu & Wu, 1988, (178); Wu et al., 
1994, (179);Zenkeetal., 1990, (180); Wuetal., 1991,(181). 

Determination of a Therapeutical ly Effective Dose 

The determination of a therapeutically effective dose is well within the capability of those skilted 
in the art. A therapeutically effective dose refers to that amount of active ingredient which 
increases or decreases human J3REAST CANCER GENE" activity relative to the human 
„BREAST CANCER GENE" activity which occurs in the absence of the therapeutically effective 
dose. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays or in animal models, usually mice, rabbits, dogs, or pigs. The animal model also can 
be used to determine the appropriate concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in humans. 

Therapeutic efficacy and toxicity, e.g., ED 50 (the dose therapeutically effective in 50% of the 
population) and LD 50 (the dose lethal to 50% of the population), can be determined by standard 
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pharmaceutical procedures in cell cultures or experimental animals. The dose ratio of toxic to 
therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LD 5 o/ED so . 

Pharmaceutical compositions which exhibit large therapeutic indices are preferred. The dats 
obtained from cell culture assays and animal studies is used in formulating a range of dosage fo: 
human use. The dosage contained in such compositions is preferably within a range of circulatinj 
concentrations that include the ED 50 with little or no toxicity. The dosage varies within this rang< 
depending upon the dosage form employed, sensitivity of the patient, and the route o: 
administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the subjecl 
that requires treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active ingredient or to maintain the desired effect. Factors which can be taken into account include 
the severity of the disease state, general health of the subject, age, weight, and gender of the 
subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions can be administered 
every 3 to 4 days, every week, or once every two weeks depending on the half-life and clearance 
rate of the particular formulation. 

Normal dosage amounts can vary from 0.1 to 100,000 micrograms, up to a total dose of about 1 g, 
depending upon the route of administration. Guidance as to particular dosages and methods of 
delivery is provided in the literature and generally available to practitioners in the art.' Those 
skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

If the reagent is a single-chain antibody, polynucleotides encoding the antibody can be constructed 
and introduced into a cell either ex vivo or in vivo using well-established techniques including, but 
not limited to, transferrin-polycation-mediated DNA transfer, transfection with naked or 
encapsulated nucleic acids, liposome-mediated cellular fusion, intracellular transportation of 
DNA-coated latex beads, protoplast fusion, viral infection, electroporation, a gene gun, and 
DEAE- or calcium phosphate-mediated transfection. 

Effective in vivo dosages of an antibody are in the range of about 5 ug to about 50 ug/kg, about 
50 ug to about 5 mg/kg, about 100 ug to about 500 ug/kg of patient body weight, and about 200 to 
about 250 ug/kg of patient body weight. For administration of polynucleotides encoding single- 
chain antibodies, effective in vivo dosages are in the range of about 100 ng to about 200 ng, 
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500 ng to about 50 mg, about 1 ng to about 2 mg, about 5 \ig to about 500 p.g, and about 20 ng to 
about 100 fig ofDNA. 

If the expression product is mRNA, the reagent is preferably an antisense oligonucleotide or a 
ribozyme. Polynucleotides which express antisense oligonucleotides or ribozymes can be 
introduced into cells by a variety of methods, as described above. 

Preferably, a reagent reduces expression of a „BREAST CANCER GENE" gene or the activity of 
a "BREAST CANCER GENE" polypeptide by at least about 10, preferably about 50, more 
preferably about 75, 90, or 100% relative to the absence of the reagent. The effectiveness of the 
mechanism chosen to decrease the level of expression of a „BRE AST CANCER GENE" gene or 
the activity of a „BREAST CANCER GENE" polypeptide can be assessed using methods well 
known in the art, such as hybridization of nucleotide probes to ,3REAST CANCER GEN^^ 
specific mRNA, quantitative RT-PCR, immunologic detection of a „BREAST CANCER GENE" 
polypeptide, or measurement of „BREAST CANCER GENE" activity. 

In any of the embodiments described above, any of the pharmaceutical compositions of the 
invention can be administered in combination with other appropriate therapeutic agents. Selection 
of the appropriate agents for use in combination therapy can be made by one of ordinary skill in 
the art, according to conventional pharmaceutical principles. The combination of therapeutic 
agents can act synergistically to effect the treatment or prevention of the various disorders 
described above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

Any of the therapeutic methods described above can be applied to any subject in need of such 
therapy, including, for example, birds and mammals such as dogs, cats, cows, pigs, sheep, go^fc 
horses, rabbits, monkeys, and most preferably, humans. 

All patents and patent applications cited in this disclosure are expressly incorporated herein by 
reference. The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples which are provided 
for purposes of illustration only and are not intended to limit the scope of the invention. 

Pharmaceutical Compositions 

The invention also provides pharmaceutical compositions which can be administered to a patient 
to achieve a therapeutic effect. Pharmaceutical compositions of the invention can comprise, "for 
example, a ,JBREAST CANCER GENE" polypeptide, ,3REAST CANCER GENE" polynucleo- 
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tide, ribozymes or antisense oligonucleotides, antibodies which specifically bind to a „BREAS 
CANCER GENE" polypeptide, or mimetics, agonists, antagonists, or inhibitors of a „BREAS 
CANCER GENE" polypeptide activity. The compositions can be administered alone or i 
combination with at least one other agent, such as stabilizing compound, which can t 
administered in any sterile, biocompatible pharmaceutical carrier, including, but not limited t< 
saline, buffered saline, dextrose, and water. The compositions can be administered to a patier 
alone, or in combination with other agents, drugs or hormones. 

In addition to the active ingredients, these pharmaceutical compositions can contain suitabl 
pharmaceutical^ acceptable carriers comprising excipients and auxiliaries which facilitat 
processing of the active compounds into preparations which can be used pharmaceutically 
Pharmaceutical compositions of the invention can be administered by any number of route 
including, but not limited to, oral, intravenous, intramuscular, intraarterial, intramedullary 
intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, parenteral 
topical, sublingual, or rectal means. Pharmaceutical compositions for oral administration can b« 
formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable foi 
oral administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, f 0l 
ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of 'active 
compounds with solid excipient, optionally grinding a resulting mixture, and processing the 
mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores, 
suitable excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 
methyl cellulose, hydroxypropylmethylcellulose, or sodium carboxymethylcellulose; gums 
including arabic and tragacanth; and proteins such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents can be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores can be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which also can contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments can be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, i.e., dosage. 
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Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as 
well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. Push-fit 
capsules can contain active ingredients mixed with a filler or binders, such as lactose or starches, 
lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, the 
active compounds can be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration can be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's 
solution, or physiologically buffered saline. Aqueous injection suspensions can contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, 
or dextran. Additionally, suspensions of the active compounds can be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame <# 
or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Non-lipid 
polycationic amino polymers also can be used for delivery. Optionally, the suspension also can 
contain suitable stabilizers or agents which increase the solubility of the compounds to allow for 
the preparation of highly concentrated solutions. For topical or nasal administration, penetrants 
appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants 
are generally known in the art. 

The pharmaceutical compositions of the present invention can be manufactured in a manner that is 
known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee making, 
levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. The pharmaceutical 
composition can be provided as a salt and can be formed with many acids, including but not 
limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to be n^ 
soluble in aqueous or other protonic solvents than are the corresponding free base forms. In other 
cases, the preferred preparation can be a lyophiiized powder which can contain any or all of the 
following: 150 mM histidine, 0.1%2% sucrose, and 27% mannitol, at a pH range of 4.5 to 5.5, that 
is combined with buffer prior to use. 

Further details on techniques for formulation and administration can be found in the latest edition 
of REMINGTON'S PHARMACEUTICAL SCIENCES (182). After pharmaceutical compositions have 
been prepared, they can be placed in an appropriate container and labeled for treatment of an 
indicated condition. Such labeling would include amount, frequency, and method of 
administration. 
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MA TERIAL AND METHODS 

One strategy for identifying genes that are involved in breast cancer is to detect genes that ar 
expressed differentially under conditions associated with the disease versus non-diseas 
conditions. The sub-sections below describe a number of experimental systems which may be use. 
to detect such differentially expressed genes. In general, these experimental systems include a 
least one experimental condition in which subjects or samples are treated in a manner associate! 
with breast cancer, in addition to at least one experimental control condition lacking such diseasi 
associated treatment. Differentially expressed genes are detected, as described below, b] 
comparing the pattern of gene expression between the experimental and control conditions. 

Once a particular gene has been identified through the use of one such experiment, its expressior 
pattern may be further characterized by studying its expression in a different experiment and the 
findings may be validated by an independent technique. Such use of multiple experiments may be 
useful in distinguishing the roles and relative importance of particular genes in breast cancer. A 
combined approach, comparing gene expression pattern in cells derived from breast cancel 
patients to those of in vitro cell culture models can give substantial hints on the pathways involved 
in development and/or progression of breast cancer. 

Among the experiments which may be utilized for the identification of differentially expressed 
genes involved in malignant neoplasia and breast cancer, for example, are experiments designed to 
analyze those genes which are involved in signal transduction. Such experiments may serve to 
identify genes involved in the proliferation of cells. 

Below are methods described for the identification of genes which are involved in breast cancer. 
Such represent genes which are differentially expressed in breast cancer conditions relative to their 
expression in normal, or non-breast cancer conditions or upon experimental manipulation based on 
clinical observations. Such differentially expressed genes represent "target" and/or "marker" 
genes. Methods for the further characterization of such differentially expressed genes, and for their 
identification as target and/or marker genes, are presented below. 

Alternatively, a differentially expressed gene may have its expression modulated, i.e., 
quantitatively increased or decreased, in normal versus breast cancer states, or under control 
versus experimental conditions. The degree to which expression differs in normal versus breast 
cancer or control versus experimental states need only be large enough to be visualized via 
standard characterization techniques, such as, for example, the differential display technique 
described below. Other such standard characterization techniques by which expression differences 
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may be visualized include but are not limited to quantitative RT-PCR and Northern analyses, 
which are well known to those of skill in the art. 

As part of this invention, a method is described by way of illustration and not by limitation, 
displaying at least some of the below mentioned aspects: 

1. A method for the prediction, diagnosis or prognosis of malignant neoplasia by the detection 
of at least 2 markers characterized in that the markers are genes and fragments thereof or 
genomic nucleic acid sequences that are located on one chromosomal region which is 
altered in malignant neoplasia. 

2. A method for the prediction, diagnosis or prognosis of malignant neoplasia by the detection 
of at least 2 markers characterized in that the markers are: 

a) genes that are located on one or more chromosomal region(s) which is/are altered 
in malignant neoplasia; and 

b) 

i) receptor and ligand; or 

ii) members of the same signal transduction pathway; or 

iii) members of synergistic signal transduction pathways; or 

iv) members of antagonistic signal transduction pathways; or 

v) transcription factor and transcription factor binding site. 

3. The method of aspect 1 or 2 wherein the malignant neoplasia is breast cancer, ovarian 
cancer, gastric cancer, colon cancer, esophageal cancer, mesenchymal cancer, bladder 
cancer or non-small cell lung cancer. 

4. The method of aspect 1 or 2 wherein at least one chromosomal region is defined as the 
cytogenetic region: lpl3, lq32, 3p21-p24, 5pl3-pl4, 8q23-q24, llql3, 12ql3,17ql2-q24 
or 20ql3. 

5. The method of aspect 1 or 2 wherein at least chromosomal region is defined as the 
cytogenetic region 17ql 1.2-21. 3 and the malignant neoplasia is breast cancer, ovarian 
cancer, gastric cancer, colon cancer, esophageal cancer, mesenchymal cancer, bladder 
cancer or non-small cell lung cancer. 
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The method of aspect 1 or 2 wherein at least one chromosomal region is defined as t 
cytogenetic region 3p21-24 and the malignant neoplasia is breast cancer, ovarian canc< 
gastric cancer, colon cancer, esophageal cancer, mesenchymal cancer, bladder cancer 
non-small cell lung cancer. 

The method of aspect 1 or 2 wherein at least one chromosomal region is defined as u* 
cytogenetic region 12ql3 and the malignant neoplasia is breast cancer, ovarian cance 
gastric cancer, colon cancer, esophageal cancer, mesenchymal cancer, bladder cancer ( 
non-small cell lung cancer. 

A method for the prediction, diagnosis or prognosis of malignant neoplasia by th 
detection of at least one marker whereby the marker is a VNTR, SNP, RFLP or ST, 
characterized in that the marker is located on one chromosomal region which is altered i 
malignant neoplasia due to amplification and the marker is detected in a cancerous and ; 
non-cancerous tissue or biological sample of the same individual. 

The method of aspect 8 wherein the marker is selected from the group consisting of th. 
VNTRs: 

D17S946, D17S1181, D17S2026, D17S838, D17S250, D17S1818, D17S614, D17S2019 
D17S608, D17S1655, D17S2147, D17S754, D17S1814, D17S2007, D17$1246 : 
D17S1979, D17S1984, D17S1984, D17S1867, D17S1788, D17S1836, D17S1787 
D17S1660, D17S2154, D17S1955, D17S2098, D17S518, D17S1851, D11S4358! 
D17S964, D19SI091, D17S1179, D10S2160, D17S1230, D17S1338, D17S201l[ 
D17S1237, D17S2038, D17S2091, D17S649, DHSl 190 and M87506. 

The method of aspect 8 wherein the marker is selected from the group consisting of the 
SNPs: 

rs2230698, rs2230700, rsl058808, rsl801200, rs903506, rs2313170, rsl 136201, 
rs2934968, rs2172826, rsl810132, rsl80120l, rs2230702, rs2230701, rsll26503, rs3471,' 
rsl3695, rs471692, rs558068, rsl064288, rsl061692, rs520630, rs782774, ^565121,' 
rs2586112, rs532299, rs2732786, rsl804539, rsl804538, rsl804537, rsll41364, rsl2231 
rsll32259, rsll32257, rsll32256, rsll32255, rsll32254, rsll32252, rsll32268 and 
rsl 132258 

A method for the prediction, diagnosis or prognosis of malignant neoplasia by the 
detection of at least one marker characterized in that the marker is selected from: 
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a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 26, 53 to 75, or 315 to 318 ; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) and encodes a polypeptide exhibiting 

5 the same biological function as specified for the respective sequence in Table 2 or 3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (c) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

10 d) a polynucleotide or polynucleotide analog which represents a specific fragmem, 

derivative or allelic variation of a polynucleotide sequence specified in (a) to (d) W 

. . e ) a purified polypeptide encoded , by a polynucleotide or polynucleotide analog 
sequence specified in (a) to (e) 

f) A purified polypeptide comprising at least one of the sequences of SEQ ID NO: 28 
15 to 32, 34, 35, 37 to 42, 44, 45, 47 to 52, 76 to 98, or 393 to 396; 

are detected. 

12. A method for the prediction, diagnosis or prognosis of malignant neoplasia by the 
detection of at least 2 markers characterized in that at least 2 markers are selected from: 

a) polynucleotide or polynucleotide analog comprising at least one of the sequencj| 
20 of SEQ ID NO: 1 to 26 or 53 to 75 or 315 to 318; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) and encodes a polypeptide 
exhibiting the same biological function as specified for the respective sequence in 
Table 2 or 3 

25 c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 

polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 
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d) a polynucleotide or polynucleotide analog which represents a specific fragme 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 

e) a purified polypeptide encoded by a polynucleotide sequence or polynucleoti. 
analog specified in (a) to (d) 

5 f) a purified polypeptide comprising at least one of the sequences of SEQ ID NO: : 

to 52 or 76 to 98 or 393 to 396 

are detected. 

13. The method of any of the aspects 1 or 12 wherein the detection method comprises the w 
of PCR, arrays or beads. 

10 14. A diagnostic kit comprising instructions for conducting the method of any of aspects 1 t 
13. 

15. A composition for the prediction, diagnosis or prognosis of malignant neoplasi 
comprising: 



15 



20 



25 



a) a detection agent for: 

i) any polynucleotide or polynucleotide analog comprising at least one of th< 
sequences of SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 53 to 75 
or 315 to 318, 

ii) any polynucleotide or polynucleotide analog which hybridizes undei 
stringent conditions to a polynucleotide specified in (a) encoding a 
polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

iii) a polynucleotide or polynucleotide analog the sequence of which deviates 
from the polynucleotide specified in (a) and (b) due to the generation of 
the genetic code encoding a polypeptide exhibiting the same biological 
function as specified for the respective sequence in Table 2 or 3 

iv) a polynucleotide or polynucleotide analog which represents a specific 
fragment, derivative or allelic variation of a polynucleotide sequence 
specified in (a) to (c) 
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v) a polypeptide encoded by a polynucleotide or polynucleotide analog 
sequence specified in (a) to (d); 

vi) a polypeptide comprising at least one of the sequences of SEQ ID NO: 28 
to 32, 34, 35, 37 to 42, 44, 45, 47 to 52, 76 to 98, or 393 to 396. 



or 



b) at least 2 detection agents for at least 2 markers selected from: 

i) any polynucleotide comprising at least one of the sequences of SEQ ID 
NO: 1 to 26 or 53 to 75 or 315 to 318; 

ii) any polynucleotide which hybridizes under stringent conditions to a 
10 polynucleotide specified in (a) encoding a polypeptide exhibiting the samW 

biological function as specified for the respective sequence in Table 2 or 3 

iii) a polynucleotide the sequence of which deviates from the polynucleotide 
specified in (a) and (b) due to the generation of the genetic code encoding 
a polypeptide exhibiting the same biological function as specified for the 

15 respective sequence in Table 2 or 3 

iv) a polynucleotide which represents a specific fragment, derivative or allelic 
variation of a polynucleotide sequence specified in (a) to (c) 

v) a polypeptide encoded by a polynucleotide sequence specified in (a) to (d); 

vi) a polypeptide comprising at least one of the sequences of SEQ ID NO: ^ 
20 to 52 or 76 to 98 or 393 to 396. 

16. An array comprising a plurality of polynucleotides or polynucleotide analogs wherein each 
of the polynucleotides is selected from: 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 1 to 26 or 53 to 75 or 315 to 318; 

25 b) a polynucleotide or polynucleotide analog which hybridizes under stringent 

conditions to a polynucleotide specified in (a) encoding a polypeptide exhibiting 
the same biological function as specified for the respective sequence in Table 2 or 
3 
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c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

5 d) a polynucleotide or polynucleotide analog which represents a specific fragment, 

derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 

attached to a solid support. 

17. A method of screening for agents which regulate the activity of a polypeptide encoded by 
a polynucleotide or polynucleotide analog selected from the group consisting of: 

10 a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 

of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 26, 53 to 75 or 315 to 318; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) encoding a polypeptide exhibiting 
the same biological function as specified for the respective sequence in Table 2 or 

15 3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

20 d) a polynucleotide or polynucleotide analog which represents a specific fragment, 

derivative or allelic variation of a polynucleotide sequence specified in (a) to (c); 

comprising the steps of: 

i) contacting a test compound with at least one polypeptide encoded by a 
polynucleotide specified in (a) to (d); and 

25 ii) detecting binding of the test compound to the polypeptide, wherein a test 

compound which binds to the polypeptide is identified as a potential therapeutic 
agent for modulating the activity of the polypeptide in order to prevent of treat 
malignant neoplasia. 
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1 8. A method of screening for agents which regulate the activity of a polypeptide encoded by 
a polynucleotide or polynucleotide analog selected from the group consisting of: 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 53 to 75, or 315 to 318; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) encoding a polypeptide exhibiting 
the same biological function as specified for the respective sequence in Table 2 or 
3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for tnr 
respective sequence in Table 2 or 3 



d) a polynucleotide or polynucleotide analog which represents a specific fragment, 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 

comprising the steps of: 

i) contacting a test compound with at least one polypeptide encoded by a 
polynucleotide specified in (a) to (d); and 

ii) detecting the activity of the polypeptide as specified for the respective sequence in 
Table 2 or 3, wherein a test compound which increases the activity is identified as 
a potential preventive or therapeutic agent for increasing the polypeptide aciti^ 
in malignant neoplasia, and wherein a test compound which decreases the activity 
— tr~"j f-f"— "o lUMiuiivu aa a jjuibuuai uicia{/cuuv agent lor Decreasing the 
polypeptide activity in malignant neoplasia. 

19. A method of screening for agents which regulate the activity of a polynucleotide or 
polynucleotide analog selected from group consisting of; 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 26, 53 to 75 or 315 to 318; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) encoding a polypeptide exhibiting 
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the same biological function as specified for the respective sequence in Table 2 c 
3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from th 
polynucleotide specified in (a) and (b) due to the generation of the genetic cod 
encoding a polypeptide exhibiting the same biological function as specified for th. 
respective sequence in Table 2 or 3 

d) a polynucleotide or polynucleotide analog which represents a specific fragment 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 

comprising the steps of: 

i) contacting a test compound with at least one polynucleotide or polynucleotide 
analog specified in (a) to (d), and 

ii) detecting binding of the test compound to the polynucleotide, wherein a tesl 
compound which binds to the polynucleotide is identified as a potential preventive 
or therapeutic agent for regulating the activity of the polynucleotide in malignant 
neoplasia. 

Use of ! 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 53 to 75 or 3 15 to 3 18; 

b) a polynucleotide which hybridizes under stringent conditions to a polynucleotide 
or polynucleotide analog specified in (a) encoding a polypeptide exhibiting the 
same biological function as specified for the respective sequence in Table 2 or 3; 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3; 

d) a polynucleotide or polynucleotide analog which represents a specific fragment, 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c); 

e) an antisense molecule targeting specifically one of the polynucleotide sequences 
specified in (a) to (d); 
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f) a purified polypeptide encoded by a polynucleotide or polynucleotide analog 
sequence specified in (a) to (d) 

g) a purified polypeptide comprising at least one of the sequences of SEQ ID NO: 28 
to 32, 34, 35, 37 to 42, 44, 45, 47 to 52, 76 to 98 or 393 to 396; 

5 h) an antibody capable of binding to one of the polynucleotide specified in (a) to (d) 

or a polypeptide specified in (f) and (g); 

i) a reagent identified by any of the methods of aspect 17 to 19 that modulates the 
amount or activity of a polynucleotide sequence specified in (a) to (d) or a 
polypeptide specified in (f) and (g); 

10 in the preparation of a composition for the prevention, prediction, diagnosis, prognosis 

medicament for the treatment of malignant neoplasia. 



2 1 . Use of aspect 20 wherein the disease is breast cancer. 

22. A reagent that regulates the activity of a polypeptide selected from the group consisting of: 

a) a polypeptide encoded by any polynucleotide or polynucleotide analog comprising 
15 at least one of the sequences of SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 

53 to 75 or 315 to 318; 

b) a polypeptide encoded by any polynucleotide or polynucleotide analog which 
hybridizes under stringent conditions to any polynucleotide comprising at least 
one of the sequences of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 26, 5^j| 

20 75 or 3 1 5 to 3 1 8 encoding a polypeptide exhibiting the same biological function as 

specified for the respective sequence in Table 2 or 3 

c) a polypeptide encoded by any polynucleotide or polynucleotide analog the 
sequence of which deviates from the polynucleotide specified in (a) and (b) due to 
the generation of the genetic code encoding a polypeptide exhibiting the same 

25 biological function as specified for the respective sequence in Table 2 or 3 

d) a polypeptide encoded by any polynucleotide or polynucleotide analog which 
represents a specific fragment, derivative or allelic variation of a polynucleotide 
sequence specified in (a) to (c)_encoding a polypeptide exhibiting the same 
biological function as specified for the respective sequence in Table 2 or 3 
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e) or a polypeptide comprising at least one of the sequences of SEQ ID NO: 28 to 3 
34, 35, 37 to 42, 44, 45, 47 to 52, 76 to 98 or 393 to 396; 

wherein said reagent is identified by the method of any of the aspects 17 to 19. 

A reagent that regulates the activity of a polynucleotide or polynucleotide analog selecte 
from the group consisting of: 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequence 
SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 53 to 75 or 315 to 318; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringer 
conditions to a polynucleotide specified in (a) encoding a polypeptide exhibitini 
the same biological function as specified for the respective sequence in Table 2 o 
3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from th< 
polynucleotide specified in (a) and (b) due to the generation of the genetic cod< 
encoding a polypeptide exhibiting the same biological function as specified for ttu 
respective sequence in Table 2 or 3 

d) a polynucleotide or polynucleotide analog which represents a specific fragment, 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 
encoding a polypeptideexhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

wherein said reagent is identified by the method of any of the aspects 17 to 19. 
24. A pharmaceutical composition, comprising: 

a) an expression vector containing at least one polynucleotide or polynucleotide 
analog selected from the group consisting of: 

i) a polynucleotide or polynucleotide analog comprising at least one of the 
sequences of SEQ ID NO: 2 to 6, 8, 9, 1 1 to 16, 18, 19, 21 to 26, 53 to 75 
or315to318; 



ii) a polynucleotide or polynucleotide analog which hybridizes under 
stringent conditions to a polynucleotide specified in (a) encoding a 



BHC03 1 051 



- 124- 

polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 

iii) a polynucleotide or polynucleotide analog the sequence of which deviates 
from the polynucleotide specified in (a) and (b) due to the generation of 

5 the genetic codejsncoding a polypeptide exhibiting the same biological 

function as specified for the respective sequence in Table 2 or 3 

iv) a polynucleotide or polynucleotide analog which represents a specific 
fragment, derivative or allelic variation of a polynucleotide sequence 
specified in (a) to (c)_encoding a polypeptide exhibiting the same 

! o biological function as specified for the respective sequence in Table 2 or 3; 

or the reagent of aspect 22 or 23 and a pharmaceutically acceptable carrier. ^ 
25. - A computer-readable medium comprising: 

a) at least one digitally encoded value representing a level of expression of at least 
one polynucleotide sequence of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 

15 26, 53 to 75 or 315 to 318 

b) al least 2 digitally encoded values representing the levels of expression of at least 
2 polynucleotide sequences selected from SEQ ID NO: 1 to 26, 53 to 75 or 315 to 
318 

in a cell from the a subject at risk for or having malignant neoplasia. 

20 26. A method for the detection of chromosomal alterations characterized in that the rela^ 
abundance of individual mKNAs, encoded by genes, located in altered chromosomal 
regions is detected. 

27. A method for the detection of chromosomal alterations characterized in that the copy 
number of one or more chromosomal region(s) is detected by quantitative PCR. 
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EXAMPLE 1 

Expression profiling 

a) Expression nrofilinz utilizmcr ^ vcmtitatiw RT.Pm 

For a detailed analysis of gene expression by quantitative PGR methods, one will utilize prim, 
flanking the genomic region of interest and a fluorescent labeled probe hybridizing in-betwe< 
Using the PRISM 7700 Sequence Detection System of PE Applied Biosystems (Perkin Elm, 
Foster City, CA, USA) with the technique of a fluorogenic probe, consisting of an oligonucleoti 
labeled with both a fluorescent reporter dye and a quencher dye, one can perform such 
expression measurement. Amplification of the probe-specific product causes cleavage of tl 
probe, generating an increase in reporter fluorescence. Primers and probes were selected using tl 
Primer Express software and localized mostly in the 3' region of the coding sequence or in the 
untranslated region (see Table 5 for primer- and probe- sequences) according to the relath 
positions of the probe sequence used for the construction of the Affymetrix HGU95 A-E or HC 
U133A-B DNA-chips. All primer pairs were checked for specificity by conventional PC] 
reactions. To standardize the amount of sample RNA, GAPDH was selected as a reference since 
was not differentially regulated in the samples analyzed. TaqMan validation experiments wer 
performed showing that the efficiencies of the target and the control amplifications ar. 
approximately equal which is a prerequisite for the relative quantification of gene expression * 
the comparative AACx method, known to those with skills in the art. 

As well as the technology provided by Perkin Elmer one may use other technique implementation, 
like Lightcycler ™ from Roche Inc. or iCycler from Stratagene Inc.. 

b) Expression nrnfili™ utilizing DN4 microarrays 

Expression profiling can bee carried out using the Afrymetrix Array Technology. By hybridization 
of mRNA to such a DNA-array or DNA-Chip, it is possible to identify the expression value of 
each transcripts due to signal intensity at certain position of the array. Usually these DNA-arrays 
are produced by spotting of cDNA, oligonucleotides or subcloned DNA fragments. In case of 
Aflymetrix technology app. 400.000 individual oligonucleotide sequences were synthesized on the 
surface of a silicon wafer at distinct positions. The minimal length of oligomers is 12 nucleotides 
preferable 25 nucleotides or full length of the questioned transcript. Expression profiling may also 
be carried out by hybridization to nylon or nitro-cellulose membrane bound DNA or 

oligonucleotides. Detection of signals derived from hvKri^^ u • , , 

m nybndization may be obtained by either 

colorimetric, fluorescent, electrochemical, electronic „_ , ^ 

^ironic, optic or by radioactive readout. Detailed 
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description of array construction have been mentioned above and in other patents cited. To 
determine the quantitative and qualitative changes in the chromosomal region to analyze, RNA 
from tumor tissue which is suspected to contain such genomic alterations has to be compared to 
RNA extracted from benign tissue (e.g. epithelial breast tissue, or micro dissected ductal tissue) on 
the basis of expression profiles for the whole transcriptome. With minor modifications, the sample 
preparation protocol followed the Affymetrix GeneChip Expression Analysis Manual (Santa 
Clara, CA). Total RNA extraction and isolation from tumor or benign tissues, biopsies, cell 
isolates or cell containing body fluids can be performed by using TRIzol (Life Technologies, 
Rockville, MD) and Oligotex mRNA Midi kit (Qiagen, Hilden, Germany), and an ethanol 
precipitation step should be carried out to bring the concentration to 1 mg/ml. Using 5-10 mg of 
mRNA to create double stranded cDNA by the Superscript system (Life Technologies). First 
strand cDNA synthesis was primed with a T7-(dT24) oligonucleotide. The cDNA can be extracted 
with phenol/chloroform and precipitated with ethanol to a final concentration of lmg /ml. From* 
the generated cDNA, cRNA can be synthesized using Enzo's (Enzo Diagnostics Inc., Farmingdale, 
NY) in viitro Transcription Kit. Within the same step the cRNA can be labeled with biotin 
nucleotides Bio-ll-CTP and Bio-16-UTP (Enzo Diagnostics Inc., Farmingdale, NY) . After 
labeling and cleanup (Qiagen, Hilden (Germany) the cRNA then should be fragmented in an 
appropriated fragmentation buffer (e.g., 40 mM Tris-Acetate, pH 8.1, 100 mM KOAc, 30 mM 
MgOAc, for 35 minutes at 94°C). As per the Affymetrix protocol, fragmented cRNA should be 
hybridized on the HG_U133 arrays A and B, comprising app. 40.000 probed transcripts each, for 
24 hours at 60 rpm in a 45°C hybridization oven. After Hybridization step the chip surfaces have 
to be washed and stained with streptavidin phycoerythrin (SAPE; Molecular Probes, Eugene, OR) 
in Affymetrix fluidics stations. To amplify staining, a second labeling step can be introduced, 
which is recommended but not compulsive. Here one should add SAPE solution twice with|| 
antistreptavidin biotinylated antibody. Hybridization to the probe arrays may be detected W 
fiuoromeiric scanning (Hewlett Packard Gene Array Scanner: Hewlett Packard Corporation, Palo 
Alto, CA). 

After hybridization and scanning, the microarray images can be analyzed for quality control, 
looking for major chip defects or abnormalities in hybridization signal. Therefor either Affymetrix 
GeneChip MAS 5.0 Software or other microarray image analysis software can be utilized. Primary 
data analysis should be carried out by software provided by the manufacturer.. 

In case of the genes analyses in one embodiment of this invention the primary data have been 
analyzed by further bioinformatic tools and additional filter criteria. The bioinformatic analysis is 
described in detail below. 
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c) Data analysis 



According to Affymetrix measurement technique (Asymetrix GeneChip Expression Analy: 
Manual, Santa Clara, CA) a single gene expression measurement on one chip yields the avera 
difference value and the absolute call. Each chip contains 16-20 oligonucleotide probe pairs p 
gene or cDNA clone. These probe pairs include perfectly matched sets and mismatched sets, bo 
of which are necessary for the calculation of the average difference, or expression value, 
measure of the intensity difference for each probe pair, calculated by subtracting the intensity', 
the mismatch from the intensity of the perfect match. This takes into consideration variability : 
hybridization among probe pairs and other hybridization artifacts that could affect the fluorescenc 
intensities. The average difference is a numeric value supposed to represent the expression vak 
of that gene. The absolute call can take the values 'A' (absent), <M* (marginal), or T> (presen 
and denotes the quality of a single hybridization. We used both the quantitative information give 
by the average difference and the qualitative information given by the absolute call to identify th 
genes which are differentially expressed in biological samples from individuals with breast cance 
versus biological samples from the normal population. With other algorithms than the Affymetri: 
one we have obtained different numerical values representing the same expression values an, 
expression differences upon comparison. 

The differential expression E in one of the breast cancer groups compared to the norma 
population is calculated as follows. Given n average difference values d„ d 2 , d, in the breas, 
cancer population and m average difference values c„ c 2 , .... c, in the population of normal 
individuals, it is computed by the equation: 

If dj<50 or c,-<50 for one or more values of i and j, these particular values c f and/or d } are set to an 
"artificial" expression value of 50. These particular computation of E allows for a correct 
comparison to TaqMan results. 

A gene is called up-regulated in breast cancer versus normal ifE>1.5 and if the number of absolute 
calls equal to 'P' in the breast cancer population is greater than n/2. 

A gene is called down-regulated in breast cancer versus normal if E^1.5 and if the number of 
absolute calls equal to 'P' in the normal population is greater than m/2. 
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The final list of differentially regulated genes consists of all up-regulated and all down-regulated 
genes in biological samples from individuals with breast cancer versus biological samples from 
the normal population. Those genes on this list which are interesting for a pharmaceutical 
application were finally validated by TaqMan. If a good correlation between the expression 
values/behavior of a transcript could be observed with both techniques, such a gene is listed in 
Tables 1 to 3. 

Since not only the information on differential expression of a single gene within an identified 
ARCHEON, but also the information, on the co-regulation of several members is important for 
predictive, diagnostic, preventive and therapeutic purposes we have combined expression data 
with information on the chromosomal position (e.g. golden path) taken from public available 
databases to develop a picture of the overall transcriptom of a given tumor sample. By this 
technique not only known or suspected regions of genomes can be inspected but even mo0 
valuable, new regions of disregulation with chromosomal linkage can be identified. This is of 
value in other types of neoplasia or viral -integration and chromosomal rearrangements. By SQL 
based database searches one can retrieve information on expression, qualitative value of a 
measurement (denoted by Affymetrix MAS 5.0 Software), expression values derived from other 
techniques than DNA-chip hybridization and chromosomal linkage. 

EXAMPLE 2 

Identificatinn nf the ARCHEON 

a) Identification and localization ofv e nes or vene probes (represented by the so called probe 
**t, on Affvmetr^ arrays HG-U W -E nr HG-U133A-B) in their chromosomal context and 
order on the human senome. ^ 

For identification of larger chromosomal changes or aberrations, as they have been described in 
detail above, a sufficient number of genes, transcripts or DNA-fragments is needed. The density of 
probes covering a chromosomal region is not necessarily limited to the transcribed genes, in case 
of the use of array based CGH but by utilizing RNA as probe material the density is given by the 
distance of genes on a chromosome. The DNA-microarrays provided by Affymetrix Inc. Do 
contain hitherto all transcripts from the known humane genome, which are be represented by 
40.000 - 60.000 probe sets. By BLAST mapping and sorting the sequences of these short DNA- 
oligomers to the public available sequence of the human genome represented by the so called 
"golden path", available at the university of California in Santa Cruz or from the NCBI, a 
chromosomal display of the whole Transcriptome of a tissue specimen evolves. By graphical 
display of the individual chromosomal regions and color coding of over or under represented 
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transcripts, compared to a reference transcriptome regions with DNA gains and losses can b 
identified. 

hi Quantification of een e copy numbers bv combined IHC and quantitative PCR (PCi 

karyotyping) or directly bv quantitative PCR 

Usually one to three paraffin-embedded tissue sections that are 5 \xm thick are used to obtaii 
genomic DNA from the samples. Tissue section are stained by colorimetric IHC afte 
deparaffinization to identify regions containing disease associated cells. Stained regions an 
macrodissected with a scalpel and transferred into a micro-centrifuge tube. The genomic DNA o: 
these isolated tissue sections is extracted using appropriate buffers. The isolated DNA is then usee 
for quantitative PCR with appropriate primers and probes. Optionally the IHC staining can be 
omitted and the genomic DNA can be directly isolated with or without prior deparaffinization with 
appropriate buffers. Those who are skilled in the art may vary the conditions and buffers described 
below to obtain equivalent results. 

Reagents from DAKO (HercepTest Code No. K 5204) and TaKaRa were used (Biomedicals Cat.: 
9091) according to the manufactures protocol. 

It is convenient to prepare the following reagents prior to staining: 
Solution No. 7 

Epitope Retrieval Solution (Citrate buffer + antimicrobial agent) (lOxconc.) 
20 ml ad 200 ml aqua dest. (stable for lmonth at 2-8°C ) 
Solution No. 8 

Washing-buffer (Tris-HCl + antimicrobial agent) (10 x cone.) 
30 ml ad 300 ml destilled water (stable for lmonth at 2-8°C ) 
Staining solution: DAB 

1 ml solution is sufficient for 10 slides. The solution were prepared immediately before usage.: 

1 ml DAB buffer (Substrate Buffer solution, pH 7.5, containing H 2 02, stabilizer, enhancers and an 
antimicrobial agent) + 1 drop (25-3 jil) DAB-Chromogen (3,3^diaminobenzidine chromogen 
solution). This solution is stable for up to 5 days at 2-8°C. Precipitated substances do not influence 
the staining result. Additionally required are:2 x approx. 100 ml Xylol, 2 x approx. 100 ml Ethanol 
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100%, 2 x Ethanol 95%, aqua dest These solution can be used for up to 40 stainings. A water bath 
is required for the epitope retrieval step. 

Staining procedure: 

All reagents are pre-warmed to room temperature (20-25°C) prior to immunostaining. Likewise all 
5 incubations were performed at room temperature. Except the epitope retrieval which is performed 
in at 95°C water bath. Between the steps excess of liquid is tapped off from the slides with lintless 
tissue (Kim Wipe), 

Deparaffinization 

Slides are placed in a xylene bath and incubated for 5 minutes. The bath is changed and the step 
10 repeated once. Excess of liquid is tapped off and the slides are placed in absolute ethanol foi^ 
minutes. The bath is changed and the step repeated once. Excess of liquid is tapped off and the 
slides are placed in 95% ethanol for 3 minutes. The bath is changed and the step repeated once. 
Excess of liquid is tapped off and the slides are placed in distilled water for a minimum of 30 
seconds. 

15 Epitope Retrival 

Staining jars are filled with with diluted epitope retrieval solution and preheated in a water bath at 
95°C. The deparaffinized sections are immersed into the preheated solution in the staining jars and 
incubated for 40 minutes at 95°C. The entire jar is removed from the water bath and allowed to 
cool down at room temperature for 20 minutes. The epitope retrieval solution is decanted, the 
20 sections are rinsed in distilled water and finally soaked in wash buffer for 5 minutes. 

Peroxidase Blocking: 

Excess of buffer is tapped off and the tissue section encircled with a DAKO pen. The specimen is 
covered with 3 drops (100 ^1) Peroxidase-Blocking solution and incubated for 5 minutes. The 
slides are rinsed in distilled water and placed into a fresh washing buffer bath. 

25 Antibody Incubation 

Excess of liquid is tapped off and the specimen are covered with 3 drops (100 of Anti-Her- 
2/neu reagent (Rabbit Anti-Human Her2 Protein in 0.05 mol/L Tris/HCl, 0.1 mol/L NaCl, 
15 mmol/L pH7.2 NaN 3 containing stabilizing protein) or negative control reagent (= IGG fraction 
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of normal rabbit serum at an equivalent protein concentration as the Her2 Ab). After 30 minutes c 
incubation the slide is rinsed in water and placed into a fresh water bath. 

Visualization 

Excess of liquid is tapped off and the specimen are covered with 3 drops (100 ^1) of visualizatioi 
reagent. After 30 minutes of incubation the slide is rinsed in water and placed into a fresh wate 
bath. Excess of liquid is tapped off and the specimen are covered with 3 drops (100 jil) o 
Substrate-Chromogen solution (DAB) for 10 minutes. After rinsing the specimen with distiller 
water, photographs are taken with a conventional Olympus microscope to document the staininj 
intensity and tumor regions within the specimen. Optionally a counterstain with hematoxylin wa: 
performed. 

DNA extraction 

The whole specimens or dissected subregions are transferred into a microcentrifuge tubes 
Optionally a small amount (10nl) of preheated TaKaRa solution (DEXPAT™) is preheated and 
placed onto the specimen to facilitate sample transfer with a scalpel. 50 to 150 |al of TaKaRa 
solution were added to the samples depending on the size of the tissue sample selected. The 
sample are incubated at 100°C for 10 minutes in a block heater, followed by centrifugation al 
12.000 rpm in a microcentrifuge. The supernatant is collected using a micropet and placed in a 
separate microcentrifuge tube. If no deparaffinization step has been undertaken one has to be sure 
not to withdraw tissue debris and resin. Genomic DNA left in the pellet can be collected by adding 
resin-free TaKaRa buffer and an additional heating and centrifugation step. Samples are stored at - 
20°C. 

Genomic DNA from different tumor cell lines (MCF-7, BT-20, BT-474, SKBR-3, AU-565, 
UACC-812, UACC-893, HCC-1008, HCC-2157, HCC-1954, HCC-2218, HCC-1937, HCC1599, 
SW480), or from lymphocytes is prepared with the QIAamp® DNA Mini Kits or the QIAamp® 
DNA Blood Mini Kits according to the manufacturers protocol. Usually between Ing up to l\ig 
DNA is used per reaction. 

Quantitative PCR 

To measure the gene copy number of Ihe genes within the patient samples the respective 
primer/probes (see table below) are prepared by mixing 25 nl of the 100 jiM stock solution 'Upper 
Primer", 25 jal of the 100 \xM stock solution •'Lower Primer" with 12,5 *il of the 100 \iM stock 
solution Taq Man Probe (Quencher Tamra) and adjusted to 500 nl with aqua dest. For each 
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reaction 1,25 jxl DNA-Extract of the patient samples or 1,25 nl DNA from the cell lines were 
mixed with 8,75 jil nuclease-free water and added to one well of a 96 Well-Optical Reaction Plate 
(Applied Biosystems Part No. 4306737). 1,5 \il Primer/Probe mix, 12, nl Taq Man Universal-PCR 
Mix (2x) (Applied Biosystems Part No. 4318157) and 1 nl Water are then added. The 96 well 
plates are closed with 8 Caps/Strips (Applied Biosystems Part Number 4323032) and centrifuged 
for 3 minutes. Measurements of the PCR reaction are done according to the instructions of the 
manufacturer with a TaqMan 7900 HT from Applied Biosystems (No. 20114) under appropriate 
conditions (2 min. 50°C, 10 min. 95°C, O.lSmin. 95°C, 1 min. 60°C; 40 cycles). SoftwareSDS 2.0 
from Applied Biosysrtems is used according to the respective instructions. CT-values are then 
further analyzed with appropriate software (Microsoft Excel™). 

EXAMPLE 3 




Clinical Samples of patients being treated with Herceptin and a chemotherapeutic agent (e.g. 
docetaxel, paclitaxel, taxotere, carboplatin, cisplatin, oxaliplatin, vinorelbine) as a second line 
therapy have been obtained. These samples included formalin-fixed and paraffin-embedded 
material from primary tumours and metastatic lesions of the respective patients. However, the 
determination of the ARCHEON genes as disclosed in this invention, has also been performed 
from fresh tissue after nucleic acid extraction in an independent, neoadjuvant setting. Moreover, 
whole blood, serum and plasma samples were available for multiple patients. 

Multiparametric, clinical assessment of the response to Herceptin in combination with 
chemotherapeutics (e.g. docetaxel, taxotere, paclitaxel, vinorelbine, carboplatin, cisplatin), or 
other therapies described below, was performed. Clinical information included histological 
parameters (TNM-Stage, AJCC grade), standard molecular markers (JHC staining for estrogen 
receptor, progesteron receptor, Her-2/neu) and sonographical or radiological assessment (e.g. 
Response to treatment was evaluated according to international standards, i.e. modified WHO 
criteria and RECIST criteria. Each cancer evaluation in the course of the disease was documented 
(method and date of evaluation, organ, anatomical description, measurability, size of lesion 
(longest diameter), greatest perpendicular diameter, tumor area). Moreover, each systemic 
anticancer therapy including prior chemotherapy with anthracyclins (Doxorubicin or Epirubicin) 
and/or CMF and the response thereto was evaluated (drug, intent, duration, schedule, number of 
cycles, cumulative dose). The response to combinatory treatment of metatstatic breast cancer 
patients with Herceptin and chemotherapeutica as second line treatment the modified WHO 
criteria were used. In addition the initial disease free survival, duration of response and time to 
progreesion were taken into consideration. For definition of treatment response standard criteria 
were used: „Complete Response" („CR" = tumor shrinkage of 100 % with no residual disease 
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being clinical detectable), partial Response" („PR" = tumor shrinkage of target lesion of at lc 
50%), ..Stable Disease" („SD" = tumor shrinkage of less than 50 % or no change) 5 
..Progressive Disease" („PD" = tumor growth or new tumor lesions). 

More than 70 genes were analyzed according to the method disclosed in example 2 by combir 
IHC and quantitative PCR or directly by quantitative PCR after nucleic acid extraction from , 
formaldehyde-fixed, paraffin-embedded tissue slides. Results were reconfirmed by independ, 
methodology (VNTR and SNP detection). Alterations of the 43 ARCHEON genes w« 
determined by comparison with reference genes, that are located on the same chromosome 
intrachromosomal control,) or different chromosomes (= extrachromosomal contro 
Intrachromosomal reference genes included MMP28, hKa3 and K20. Extrachromosomal referen 
genes included GAPDH for chromosome 12. However any other gene not included in tl 
ARCHEONs disclosed in this invention can be used as reference gene for ARCHEO 
characterizetion. The reference genes should be independent from the ARCHEON alteratio, 
occuring in the neoplastic lesions and should be not affected by chromosomal alterations such ■ 
amplifications and deletions. As gene copy numbers of non-amplified genes can be increased i 
neoplastic lesions due to genomic imbalances such as aneuploidie or polyploid eac 
measurement of ARCHEON genes was correlated to multiple reference genes to minimize th 
influence of genomic imbalances on the relative copy number calculation. Moreover, mine 
systemic errors occuring due to differences in the performance of individual primer/probe pair 
were minimized by determining primer/probe performances in control tissues (i.e. non-neoplasti 
tissues from healthy controls) and euploid control cell lines (e.g. HS68, ATCC #CRL1635) 
Moreover one well charcterized, control cell line was used, that displays aneuploidie for a singl, 
chromosome (i.e. Detroit, ATCC#CCL-54; trisomie 21). By measuring genes located on the X 
chromosome (e.g. SRY), the Y-chromosome (e.g. Xist) and on chromosome 21, defined copj 
numbers of 1, 2 and 3 genes could be determined as internal control during each run fo, 
standardization. In addition, synthetic targets were spiked into some reactions, that consisted of the 
target region of the PCR forward and reverse primers of the gene to be normalized, but in between 
consisted of a synthetic probe hybridization region different from the original probe region of the 
target gene to be normalized. This allowed internal standardization of each individual qPCR 
reaction by multiplex PCR. The calculated performance differences were used as a filter for the 
measurements within the target tissues, i.e. primer/probe differences of each individual gene as 
depicted in the control cells and tissues were subtracted from each individual gene measurement 
performed in the target tissue. Thereafter, the individual, filtered CT values were normalized to the 
different reference genes. Differences between the CT values of the quantitative PCR reactions of 
the ARCHEON genes and the reference genes remaining after filtering the primer/probe 
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performance differences were determined and transformed into „copy numbers per cell". This was 
done by subtracting the CT values of the target genes from the CT values of the reference genes. 
The resulting ACT values were then transformed in gene copy numbers, with the ACT value of the 
reference gene (ACT=0) being defined as „2 copies per cell", by the following formula: 
2*(2 A ( ACT*(-1))). All the calculations were done using standard software (Microsoft ExcelTM). 
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Table 1 



DNA 
SEQ ID NO: 


Protein 
SEQ ID NO: 


Cenbank ID 


Unigene_vl 62_ID 


Locus Link ID 


Gene Name 


1 


27 


NM_006148.1 


Hs.334851 


3927 


LASP1 


2 


28 


NM_000723.1 


Hs.635 


782 


CACNB1 


3 


29 


NM_000981.1 


Hs.381061 


6143 


RPL19 


4 


30 


Y 13467 


Hs. 15589 


5469 


PPARGBP 


5 


31 


NM_016507.1 


Hs.416108 


51755 


CrkRS/CRK7 


6 


32 


AB02 1742.1 


Hs:322431 


4761 


NEUROD2 


7 


33 


NM_006804.1 


Hs.77628 


10948 




8 


34 


NM_003673.1 


Hs.343603 


8557 


TELETHONIN 


9 


35 


NM_002686.1 


Hs.1892 


5409 


PNMT 


10 


! 36 


X03363.1 


Hs.446352 


2064 


ERBB2 


11 


37 


AB008790.1 


Hs.86859 


2886 


VJXxJD / • 


12 


38 


NM_002 809.1 


Hs.9736 


5709 


JrolVJJLO 


13 


39 


NM_000759. 1 


Hs.2233 


1440 




14 


40 


AI023317 
NM_014815 


Hs.23106 


9862 


TRAP 100 


15 


41 


X55005 


Hs.724 


7067 


* 


16 


42 


X72631 


Hs.2769166 


9572 


NR1D1 


17 


43 


NM_007359.1 


Hs.83422 


22794 


lYlLiliJ 1 


18 


44 


U77949.1 


Hs.405958 


990 




19 


45 


U41742.1 
NM_000964 


Hs.361071 


5914 


PAD A 
XN-rVlVrV 


20 


46 


NM_00 1067.1 


Hs. 156346 


7153 


TOP2A 


21 


47 


NM_00 1552.1 


Hs.1516 


3487 


IGFBP4 


22 


48 


NM_00 1838.1 


Hs.1652 


1236 


CCR7 EBI1 


23 


49 


NM_003079.1 


Hs.437546 


6605 


SMARCE1 BAF57 


24 


50 


X14487 


Hs.99936 


3858 


KRT10 


25 


51 


NM_000223.1 


Hs.66739 


3859 


KRT12 


26 


52 ] 


NM_002279.2 


Hs.32950 


3884 


/KRTHA3B 


53 


76 


NM_005937 


Hs.497128 


4302 


MLLT6 
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Table 1 (continued) 



DNA 
SEQ ID NO: 


Protein 
SEQ ID NO: 


Genbank ID 


Unigene_vl62JD 


Locus Link ID 


Gene Name 


54 


77 


XM 008147/ 
NM_007144 


Hs.371617 


7703 


ZNF144/RNF110 


55 


78 


NM_1 38687 


Hs.9605 


8396 


PIP5K2B 


56 


79 


NM_020405 


Hs.125036 


57125 


TEM7/PLXDC1 


57 


80 


AF129512 


Hs.258579 


22806 


ZNFN1A3 


58 


81 


XM_085731 
NM_1 33264 


Hs.421622 


147179 


WIRE 


59 


82 


NM_O02795 


Hs.82793 


5691 


PSMB3 


60 


83 


NM 033419 


Hs.91668 


93210 


MGC9753 
Variant a 
/CAB2 


61 


84 


NM_033419 


Hs.91668 


93210 


MGC9753 
Variant c 


62 


85 


NM_033419 


Hs.91668 


93210 


MGC9753 
Variant d 


63 


86 


NM_033419 


Hs.91668 


93210 


MGC9753 
Variant e 


64 


87 


NM_033419 


Hs.91668 


93210 


MGC9753 
Variant g 


65 


88 


NM.033419 


Hs.91668 


93210 


MGC9753 
Variant h 


66 


89 


NM_033419 


Hs.91668 


93210 


MGC9753 
Variant i 


67 


90 


AF395708 


Hs.133167 


94103 


ORMDL3 


68 


91 


NM_032873 


I JtlS. lV*H*iro 




MGC15482 


69 


92 


NMJ)32192 


Hs.286192 


84152 


PPP1R1B 


70 


93 


NMJB2339 


Hs.333526 


84299 


MGC14832 


71 


94 


NM_057555 
NM_139280 


Hs.133167 


51242 


LOC51242 
/ORMDL3 


72 


95 


NMJ)17748 


Hs.406223 


54883 


FLJ20291 


73 


96 


NM_018530 


Hs.306777 


55876 


Pro2521 


74 


97 


NMJ)16339 


Hs. 158530 


51195 


Link-GEFU 
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Table 1 (continued) 



DNA 
SEQ ID NO: 


Protein 
SEQ ID NO: 


Genbank ID 


Unigene_vl62 ID 


LiOcus I jink ID 


fvPftA Noma 

went; iodine 


75 


98 


NMJ)32865 


Hs.99037 


84951 


CTEN 


315 


393 


XM_294897 


Hs.270564 


30837 


NAP4 


316 


394 


NM.032351 


Hs. 19347 


84311 


MRLP45 


317 


395 


NM_000458 


Hs.408093 


6928 


TCF2 


318 


396 


NMJ52300 


Hs.380430 


11056 


ROK1 

•IVV^XV 1 


319 


397 


NM019010 


Hs.84905 


54474 


XVXV X £.\J 


320 


398 


NM_173213 


Hs.9029 


25984 


rvxv i ^ j 


321 


399 


NMJ)33185 


Hs.307025 


85293 


XVXV A /\Tj"J 


322 


400 


NM_031959 


Hs.3 07026 


83897 


Xvxv x ATj-Z 


323 


401 


NG_000941 




85345 


1CRTAP1P1 

Xvxv l r\Jrjtr x 


324 


402 


NM_031958 


Hs.307027 


83896 


KRTAP1-1 


325 


403 


NMJ)31957 


Hs.307030 


83895 


xvjv x r\JT x -J 


326 


404 


NM_030966 


Hs.247935 


81850 


1CRTAP1 X 
xvJv 1 r\x 1 —j 


327 


405 


NM_030967 


Hs.247934 


81851 


1CRTAP1-1 
xvxv i /\r i 


328 


406 


AJ302536 




85296 


liTPTAP? 9 
xvXv I ATZ-Z - 


329 


407 


NMJ)33184 




85294 


xvxv 1 ArZ-*t - 


330 


408 


NGJ)00939 




85343 


OTAPOPI 
XVxv 1 AfZi 1 


331 


409 


NM_033061 


Hs.380164 


85287 


KRTAP4-7 


332 


410 


NM 033059 


Hs.307015 


85282 


KRTAP4-14 


333 


411 ; 


NM_031854 


Hs.307016 


83755 


KRTAP4-19 


334 


412 


NMJ)33188 


Hs.307016 


83755 


KRTAP4-5 


335 


413 


NM_033186 




85283 


KRTAP4-13 


336 


414 


NM 032524 


Hs.307022 


84616 


KRTAP4-4 


337 


415 


NMJ)33062 


Hs.380165 


85291 ! 


KRTAP4-2 


338 


416 


NMO33060 


Hs.380165 


85291 


KRTAP4-10 


339 


417 


NM 031961 


Hs.307013 


83899 


KRTAP9-2 


340 


418 


NM_031962 


Hs.307012 


83900 


KRTAP9-3 


341 


419 


NMJ)31963 


Hs.307011 


83901 


KRTAP9-8 


342 


420 


NMJ)30975 


Hs.307010 


81870 


KRTAP9-9 


343 


421 


NM_033191 




85280 


KRTAP9-4 


344 


422 


NGJ)00942 




85347 


KRTAP9P1 


345 


423 


XM_2 10345 


Hs.463016 


85276 


KRTAP16-1 


346 


424 


NMJ>31964 


Hs.307009 


83902 


KRTAP17-1 



BHC03 1 051 

-144- 

Table 1 (continued) 



cpn in no* 


m ruicm 
SEO ID NO: 


f?pnhnnk ID 


Unipene vl62 ID 


Locus Link ID 


Gene Name 


347 


425 


NM 004138 


Hs. 197874 


3883 


KRTHA3A 


34R 


426 


NM 002279 


Hs.32950 


3884 


KRTHA3B 


349 


427 


NM 021013 


Hs.296942 


3885 


KRTHA4 


3^0 


428 


NM 002277 


Hs.41696 


3881 


KRTHA1 


3^1 


410 ! 


Y 16795 




8686 


KRTHAP1 


7^9 


430 


NM 003770 


Hs 159403 


8688 


KRTHA7 




431 

*T J 1 


NM 006771 


Hs 248188 


8687 


KRTHA8 


354 


412 


NM 002278 

1NJ.V.I. VV/fcAt r O 


Hs 41752 


3882 


KRTHA2 


JJJ 


433 


NM 002280 


Hs 73082 


3886 


KRTHA5 


**5fi 


434 


NM 003771 


Hs 248189 


8689 


KRTHA6 


"*57 


435 


NM 002274 

IN 1V1 l/V/*»^» / "T 


Hs 433871 


3860 


KRT13 


**58 


43 


NM 002 27 S 


Hs 80342 


3866 


KRT15 


**5q 


437 


NM 002276 


Hs 309517 


3880 


KRT19 




43R 


NM 000226 


Hs 2783 


3857 


KRT9 




A3Q 


NM 000596 


H«; 355214 


3861 


KRT14 






NM 00SS57 


Hs 432448 


3868 


KRT16 


J 00 


A41 


NM O00499 


Hs.2785 


3872 


KRT17 




449 


NM 005556 


Hs 23881 


3855 


KRT7 v 


**A5 


443 


NG 000944 




85349 


KRTHBP4 




444 


NG 000943 




85348 


KRTHBP3 


JO/ 


445 

'rtJ 


NM 002281 

1NIYX Wa*0 1 


Hs 170925 


3887 


KRTHB1 


j Do 


AAA 


NM 002284 

AN1VX \J\J£m fcU7 


Hs-278658 


3892 


KRTHB6 


IXO 
JV7 


447 


NM 002282 


Hs. 182506 


3889 


KRTHB3 


■j t\j 


448 


NG 000940 




85344 


KRTHBP2 


371 


449 


NM 002283 


Hs. 182507 


3891 


KRTHB5 


372 


450 


NM_033045 


Hs.272336 


3890 


KRTHB4 


373 


451 


NM_033033 


Hs. 134640 


3888 


KRTHB2 ! 


374 


452 


Y19213 




85340 


KRTHBP1 


375 


453 


NM_005555 


Hs.432677 


3854 


KRT6B 


376 


1 454 


NM_173086 


Hs.446417 


286887 


KRT6E 


377 


455 


NM_058242 




140446 


KRT6C 


378 


456 


NM_005554 


Hs.367762 


3853 


KRT6A 


379 


457 


NM_000424 


Hs.433845 


3852 


KRT5 
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Table 1 (continued) 



DNA 
SEQ ID NO: 


r i uiciu 

SEO ID NO: 


uenoank ID 


Unigene_v 1 62_ID 


Locus Link ID 


Gene Name 


380 


458 


1 T 1VA VJ J*T*T O 


' Tic <<770 


112802 


KRT6IRS 


381 


459 


rN ivi 1 / jUjj 


tt_ <r<*x< 


121391 


KRT6IRS4 


382 


460 


rMlVl^Uol/ IH // 

AYOTVtQS 


US. 1471)40 


140807 


K6IRS2/ 
KRT6 


383 


461 




TT- T 1 HI A1 

ris.iiyioi 


55410 


KRT6IRS3 


384 


462 


1^1 JLVl \J\J\J t t£.j 


TT- *7A*7 

lis. /U7 


3849 


KRT2A 


385 


463 




TT- OAOIO 

rls. 80828 


3848 


KRT1 


386 


464 


NM_057088 


Hs.410397 


3850 




387 


465 


NM_002272 


Hs.371139 


3851 


KRT4 


388 


466 


NMJ)02273 


Hs.356123 


3856 


KRT8 


389 


467 


NMJ)00224 


Hs.406013 


3875 


KRT18 


390 


468 


NMJ)32950 


Hs.380710 


79148 


MMP28 


391 


469 


NMJ)05419 


Hs.72988 


6773 


STAT2 


392 


470 


NMJ)02046 


Hs. 169476 


2597 


GAPDH 
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Table 2 



TINA 

SEQ ID NO: 


Gene description 


1 

j 


Member of a subfamily of LIM proteins that contains a LIM domain and an 
5H3 (Src homology region 3) domain 


2 

i 


Beta 1 subunit of a voltage-dependent calcium channel (dihydropyridine 
receptor), involved in coupling of excitation and contraction in muscle, also 
acts as a calcium channel in various other tissues 


3 


Ribosomal protein LI 9, component of the large 60S ribosomal subunit 


4 


Protein with similarity to nuclear receptor-interacting proteins; binds and co- 
activates the nuclear receptors PPARalpha (PPARA), RARalpha (RARA), 
RXR, TRbetal, and VDR 


5 


we26e02.xl CDC2-related protein kinase 7 


6 


Neurogenic differentiation, a basic-helix-loop-helix transcription factor that 
mediates neuronal differentiation 


7 


Protein that is overexpressed in malignant tissues, contains a putative trans- 
membrane region and a StAR Homology Domain (SHD), may function in 
steroidogenesis and contribute to tumor progression 


8 


Telethonin, a sarcomeric protein specifically expressed in skeletal and heart 
muscle, caps titin (TTN) and is important for structural integrity of the 
sarcomere 


9 


Phenylethanolamine N-methyltransferase, acts in catecholamine biosynthesis 
to convert norepinephrine to epinephrine 


10 


Tyrosine kinase receptor that has similarity to the EGF receptor, a critical 
component of IL-6 signaling through the MAP kinase pathway, overexpression 
associated with prostate, ovary and breast cancer 


11 


Growth factor receptor-bound protein, an SH2 domain-containing protein max 
has isoforms which may have a role in cell invasion and metastatic progression 
of esophageal carcinomas 


! 12 


Non-ATPase subunit of the 26S proteasome (prosome, macropain) 


1 13 


Granulocyte colony stimulating factor, a glycoprotein that regulates growth, 
differentiation, and survival of neutrophilic granulocytes 
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Table 2 (continued) 



DNA 
SEQ ID NO: 



14 



15 



16 



17 
IF 



20 



21 



22 



23 



53 



Gene description 



Member of the Vitamin D Receptor Interacting Protein co-activator complex, 
has strong similarity to thyroid hormone receptor-associated protein (murine] 
TraplOO) which function as a transcriptional coregulator 



Thyroid hormone receptor alpha, a high affinity receptor for thyroid hormone 
that activates transcription; homologous to avian erythroblastic leukemia virus) 
oncogene 



encoding Kev-ErbAalp nuclear receptor subfamily 1, group D, member 1 



that is overexpressed in breast carcinomas 
Protein which interacts with the DNA replication proteins PCNA and Orel J 
translocates from the nucleus following onset of S phase; S. cerevisiael 
homolog Cdc6p is required for initiation of S phase 



Retinoic acid receptor alpha, binds retinoic acid and stimulates transcription in 
a ligand-dependent manner 



DNA topoisomerase H alpha, member of a family of proteins that relieves| 
torsional stress created by DNA replication, transcription, and cell division;. 



Insulin-like growth factor binding protein, the major IGFBP of osteoblast-like", 
cells, binds IGF1 and IGF2 and inhibits their effects on promoting DNA andj 
glycogen synthesis in osteoblastic cells 



HUMEBI103 G protein-coupled receptor (EBI 1) gene exon 3 chemokine (C-Cj 
motif) receptor 7 G protein-coupled 
Protein with an HMG 1/2 DNA-binding domain that is subunit of the 
SNF/SWI complex associated with the nuclear matrix and implicated in 
regulation of transcription by affecting chromatin structure 




iair keratin 3B, a type I keratin that is a member of a family of structural 
proteins that form intermediate filaments 



MLLT6 Myeloid/lymphoid or mixggge i eukemia (trithorax homolog 



BHC03 1 051 

-148- 



Table 2 (continued) 



DNA 
SEQ ID NO: 


VJVlIt Uvdtl ipuuu j 




)rosophila); translocated to, 6 1 


54 z 


inc finger protein 1 44 (Mel- 1 8) j 


55 F 


»hosphatidylinositol-4-phosphate 5-kinase type n beta isoform a 


56 t 


umor endothelial marker 7 precursor J 


57 2 


sine finger protein, subfamily 1 A, 3 1 


58 l 

i 


ASP-binding protein putative crl6 and wip like protein similar to Wiskott-| 
Mdrich syndrome protein ! 


59 1 


Proteasome (prosome, macropain) subunit, beta type, 3 J 


60 1 


Predicted ] 


67 


ORMl-like 3 (S. cerevisiae) _J 


68 


F-box domain A Receptor for Ubiquitination Targets j 


69 


protein phosphatase 1, regulatory (inhibitor) subunit IB (dopamine and cAMP 
regulated phosphoprotein, DARPP-32) 


70 


Predicted Protein 


71 


Predicted Protein J 


72 


Predicted Protein 


73 


Predicted Protein 1 


74 


Link-GEFH: Link guanine nucleotide exchange factor II j 


75 


C-terminal tensin-like j 


315 


Homo sapiens Nek, Ash and phosphohpase C binding protein (NAP4) 


316 


Homo sapiens mitochondrial ribosomal protein L45 (MRPL45), nuclear gene 
encoding mitochondrial protein 


317 


Homo sapiens transcription factor 2, hepatic; LF-B3; variant nepanc nuciear 
factor (TCF2), transcript variant a 


318 


Homo sapiens DEAD (Asp-Glu-Ala-Asp) box polypeptide 52 (DDX52) 


319 


Homo sapiens keratin 20 (KRT20), ,is a component of intermediate filament 
network 


320 


Homo sapiens keratin 23 (histone deacetylase inducible) tKKizjj, is a 
component of intermediate filament network transcript variant 2 


321 


Homo sapiens keratin associated protein 3-3 (KRTAP3-3) ), is a component of 
intermediate filament network 


322 


Homo sapiens keratin associated protein 3-2 (KRTAP3-2) ,is a component of 
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Table 2 (continued) 



DNA 
SEQ ID NO: 


Gene description 


i 


intermediate filament network 


323 


Homo sapiens keratin associated protein 3 pseudogene 1 (KRTAP3P1) on 
chromosome 17 ,is a component of intermediate filament network 


324 


Homo sapiens keratin associated protein 3-1 (KRTAP3-1) ,is a component of 
intermediate filament network 


325 


Homo sapiens keratin associated protein 1-5 (KRTAP1-5) ,is a component of 
intermediate filament network 


326 


Homo sapiens keratin associated protein 1-3 (KRTAPi-3) ,is a component of 
intermediate filament network 


327 


Homo sapiens keratin associated protein 1-1 (KRTAP1-1) ,is a component of 
intermediate filament network 


328 


HSA302536 Homo sapiens partial mRNA for keratin associated protein 
KAP2.2 (KRTAP2.2 gene) ,is a component of intermediate filament network 


329 


Homo sapiens keratin associated protein 2-4 (KRTAP2-4) ,is a component of 
intermediate filament network 


330 


Homo sapiens keratin associated protein 2 pseudogene 1 (KRTAP2P1) on 
chromosome 1 7,is a component of intermediate filament network 


331 


Homo sapiens keratin associated protein 4-7 (KRTAP4-7) ,is a component of 
intermediate filament network 


332 


Homo sapiens keratin associated protein 4-14 (KRTAP4-14) ,is a component 
of intermediate filament network 


333 


Homo sapiens keratin associated protein 4-12 (KRTAP4-12) ,is a component 
of intermediate filament network 


334 


Homo sapiens keratin associated protein 4-5 (KRTAP4-5) ,is a component of 
intermediate filament network 


335 


Homo sapiens keratin associated protein 4-13 (KRTAP4-13) ,is a component 
of intermediate filament network 


336 


Homo sapiens keratin associated protein 4-4 (KRTAP4-4) ,is a component of 
intermediate filament network 


337 


Homo sapiens keratin associated protein 4-2 (KRTAP4-2) ,is a component of 
intermediate filament network 


338 


Homo sapiens keratin associated protein 4-10 (KRTAP4-10) ,is a component 
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Table 2 (continued) 



DNA 
SEQ E> NO: 


Gene description 




of intermediate filament network 


339 


Homo sapiens keratin associated protein 9-2 (KRTAP9-2) ,is a component of 
intermediate filament network 


340 


Homo sapiens keratin associated protein 9-3 (KRTAP9-3) ,is a component of 
intermediate filament network 


341 


Homo sapiens keratin associated protein 9-8 (KRTAP9-8) ,is a component of 
intermediate filament network 


342 


Homo sapiens keratin associated protein 9-9 (KRTAP9-9) ,is a component of 
intermediate filament network 


343 


Homo sapiens keratin associated protein 9-4 (KRTAP9-4) ,is a component of 
intermediate filament network 


344 


Homo sapiens keratin associated protein 9 pseudogene 1 (KRTAP9P1) on 
chromosome 17 ,is a component of intermediate filament network 


345 


Homo sapiens keratin associated protein 16-1 (KRTAP16-1) ,is a component 
of intermediate filament network 


346 


Homo sapiens keratin associated protein 17-1 (KRTAP17-1) ,is a component 
of intermediate filament network 


347 


Homo sapiens keratin, hair, acidic, 3A (KRTHA3A) ,is a component of 
intermediate filament network 


348 


Homo sapiens keratin, hair, acidic, 3B (KRTHA3B) ,is a component of 
intermediate filament network 


349 


Homo sapiens keratin, hair, acidic, 4 (KRTHA4) ,is a component of 
intermediate filament network 


350 


Homo sapiens keratin, hair, acidic, 1 (KRTHA1) ,is a component of 
intermediate filament network 


351 


HSA16795 Homo sapiens KRTHAP1 pseudogene,is a component of 
intermediate filament network 


352 


tiomo sapiens keratin, hair, acidic, 7 (KRTHA7) ,is a component of 
intermediate filament network 


353 ] 
i 


Homo sapiens keratin, hair, acidic, 8 (KRTHA8) ,is a component of 
ntermediate filament network 


354 ] 


Somo sapiens keratin, hair, acidic, 2 (KRTHA2) ,is a component of 
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Table 2 (continued) 



DNA 
SEQ ID NO: 


Gene description 




intermediate filament network 


355 


Homo sapiens keratin, hair, acidic, 5 (KRTHA5) ,is a component of 
intermediate filament network 


356 


Homo sapiens keratin, hair, acidic, 6 (KRTHA6) ,is a component of 
intermediate filament network 


357 


Homo sapiens keratin 13 (KRT13), transcript variant 2,is a component of 
intermediate filament network 


358 


Homo sapiens keratin 15 (KRT15) ,is a component of intermediate filament 
network 


359 


Homo sapiens keratin 19 (KRT19) ,is a component of intermediate filament 
network 


360 


Homo sapiens keratin 9 (epidermolytic palmoplantar keratoderma) (KRT9) ,is 
a component of intermediate filament network 


361 


Homo sapiens keratin 14 (epidermolysis bullosa simplex, Dowling-Meara, 
Koebner) (KRT14) ,is a component of intermediate filament network 


362 


Homo sapiens keratin 16 (focal non-epidermolytic palmoplantar keratoderma) 
(KRT16) ,is a component of intermediate filament network 


363 


Homo sapiens keratin 17 (KRT17) ,is a component of intermediate filament 
network 


364 


Homo sapiens keratin 7 (KRT7) ,is a component of intermediate filament 
network 


365 


Homo sapiens psihHbD hair keratin pseudogene (KRTHBP4) on chromosome 
12,is a component of intermediate filament network 


366 


Homo sapiens psihHbC hair keratin pseudogene (KRTHBP3) on chromosome 
12,is a component of intermediate filament network 


367 


Homo sapiens keratin, hair, basic, 1 (KRTHB1) ,is a component of 
intermediate filament network 


368 


Homo sapiens keratin, hair, basic, 6 (monilethrix) (KRTHB6) ,is a component 
of intermediate filament network 


369 


Homo sapiens keratin, hair, basic, 3 (KRTHB3) ,is a component of 
intermediate filament network 


370 


Homo sapiens psihHbB hair keratin pseudogene (KRTHBP2) on chromosome 
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Table 2 (continued) 



DNA 
SEQ ED NO: 


Gene description 


1 


12 ,is a component of intermediate filament network 


371 1 

i 


HLomo sapiens keratin, hair, basic, 5 (KRTHB5) ,is a component of 
ntermediate filament network 


372 1 


Homo sapiens keratin, hair, basic, 4 (KRTHB4), ,is a component of 
intermediate filament network 


373 


Homo sapiens keratin, hair, basic, 2 (KRTHB2) ,is a component of 
intermediate filament network 1 


374 


HSPSIHHBA Homo sapiens putative psihHbA pseudogene for hair keratin, 
exons 2 to 7 


375 


Homo sapiens keratin 6B (KRT6B) ,is a component of intermediate filament 
network 


376 


Homo sapiens keratin 6E (KRT6E),is a component of intermediate filament 
network 


377 


Homo sapiens keratin 6C (KRT6C) ,is a component of intermediate filament 
network 


378 


Homo sapiens keratin 6A (KRT6A), ,is a component of intermediate filament 
network 


379 


Homo sapiens keratin 5 (epidermolysis bullosa simplex, Dowhng- 
Meara/Kobner/Weber-Cockayne types) (KRT5) ,is a component of 
intermediate filament network 


380 


Homo sapiens keratin 6 irs (KRT6IRS) ,is a component of intermediate 
filament network 


381 


Homo sapiens keratin 6 irs4 (K61KS4) ,is a component of intermediate 
filament network 


382 


Homo sapiens keratin protein K6irs (K6IRS2) ,is a component of intermediate 
filament network 


383 


Homo sapiens keratin protein K6irs (K6IRS2) ,is a component of intermediate 
filament network 


384 


Homo sapiens keratin 2A (epidermal ichthyosis bullosa of Siemens) (KRT2A) 
,is a component of intermediate filament network 


385 


Homo sapiens keratin I (epidermolytic hyperkeratosis) (KRT1) ,is a 
component of intermediate filament network 
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Table 2 (continued) 



I DNA 
SEQ ID NO: 


Gene description 

VVMV wVMVl IIJ*II/U 


386 


Homo sapiens keratin 3 (KRT3) ,is a component of intermediate filament 
network 


387 


Homo sapiens keratin 4 (KRT4) ,is a component of intermediate filament 
network 


388 


Homo sapiens keratin 8 (KRT8) ,is a component of intermediate filament 
network 


389 


Homo sapiens keratin 18 (KRT18) ,is a component of intermediate filament 
network 


390 


Homo sapiens matrix metalloproteinase 28 (MMP28), transcript variant 2 


391 


Homo sapiens signal transducer and activator of transcription 2, 113kDa 
(STAT2) 


392 


Homo sapiens glyceraldehyde-3-phosphate dehydrogenase (GAPD) 
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Subcellular localization 


nucleus 


nucleus 




plasma membrane 


nucleus nuclear chromosome 


cytoplasm 


cytoplasm 


cytoplasm 


i 




1 




■ 






1 


i 


• 


Gene function 


retinoic acid receptor transcription co-activator transcription factor 


DNA binding DNA topoisomerase (ATP-hydrolyzing) 


skeletal development DNA metabolism signal transduction cell proliferation 




chromatin binding transcription co-activator nucleosome disassembly transcription 


Cell structure Cytoskeletal Epidermal Development and Maintenance 


structural protein vision cell shape and cell size control intermediate filament 


cell shape and cell size control Cell structure 






leucine-zipper containing fusion 




Tumor endothelial marker 7 precursor; may be involved in angiogenesis 


Aiolos; DNA binding protein that may be a transcription factor; has strong similarity to murine 
Znfola3, contains zinc finger domain 


The WASP-binding protein WIRE has a role in the regulation of the actin filament system downstream 
of the platelet-derived growth factor receptor 
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cellular localiz 




Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


cytoplasm 


cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


cytoplasm 


cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Cytoplasm 


Sub 












































Gene function 


KRT14, integral part of the intermediate filamentous network 


KRT16, integral part of the intermediate filamentous network 


KRT17, integral part of the intermediate filamentous network 


KRT7, integral part of the intermediate filamentous network 


KRTHBP4, integral part of the intermediate filamentous network 


KRTHBP3, integral part of the intermediate filamentous network 


KRTHBl, integral part of the intermediate filamentous network 


KRTHB6, integral part of the intermediate filamentous network 


KRTHB3, integral part of the intermediate filamentous network 


KRTHBP2, integral part of the intermediate filamentous network 


KRTHB5, integral part of the intermediate filamentous network 


KRTHB4, integral part of the intermediate filamentous network 


KRTHB2, integral part of the intermediate filamentous network 


KRTHBPl, integral part of the intermediate filamentous network 


KRT6B, integral part of the intermediate filamentous network 


KRT6E, integral part of the intermediate filamentous network 


KRT6C, integral part of the intermediate filamentous network 


KRT6A, integral part of the intermediate filamentous network 


KRT5, integral part of the intermediate filamentous network 


KRT6IRS, integral part of the intermediate filamentous network ." . 
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Table 4 



DNA 
SEQID 
NO: 


Protein 
SEQID 
NO: 


Gene Name 


: DBSNP ID 


Type 


Codon 


AA-Seq 


9 


34 


ERBB2 


" rs2230698 


coding-synon 


TCA|TCG 


SIS 


9 


34 


ERBB2 


rs2230700 


noncoding 






9 


34 


ERBB2 


rsl058808 


coding- 
nonsynon 


CCC|GCC 


P|A 


9 


34 


ERBB2 


rsl801200 


noncoding 






9 


34 


ERBB2 


rs903506 


noncoding 






9 


34 


ERBB2 


rs2313170 


noncoding 






9 


34 


ERBB2 


rsl 136201 


coding- 
nonsynon 


ATC|GTC 


IIV 


9 


34 


ERBB2 


rs2934968 


noncoding 






9 


34 


ERBB2 


rs2 172826 


noncoding 






9 


34 


ERBB2 


rsl810132 


coding- 
nonsynon 


ATC|GTC 


I|V 


9 


34 


ERBB2 


rsl801201 


noncoding 






14 


39 


c-erbA-1 


rs2230702 


coding-synon 


TCC|TCT 


SIS 

1 


14 


39 


c-erbA-1 


rs2230701 


coding-synon 


GCC|GCT 




14 


39 


c-erbA-1 


rsl 126503 


coding- 
nonsynon 


ACC|AGC 


TIS 


14 


39 


c-erbA-1 


rs3471 


noncoding 






19 


44 


TOP2A 


rsl3695 


noncoding 






19 


44 


TOP2A 


rs471692 


noncoding 






19 


44 


TOP2A 


rs558068 


noncoding 






19 


44 


TOP2A 


rsl 064288 


noncoding 






19 


44 


TOP2A 


rsl061692 


coding-synon 


GGA|GGG 


GIG 


19 


44 


TOP2A 


rs520630 


noncoding 






19 


44 


TOP2A 


rs782774 


coding- 
nonsynon 


AAT|ATT|A 
TT|TTT 


N|I|I|F 


19 


44 


TOP2A 


rs565121 


noncoding 






19 


44 


TOP2A 


rs25861 12 


noncoding 






19 


44 


TOP2A 


rs532299 


coding- 
nonsynon 


TTT|GTT 


F|V 
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Table 4 (continued) 



DNA 
SEQID 
NO: 


Protein 
SEQID 
NO: 


Gene Name 


DBSNP ID 


lype 




AA-Sea 


19 


44 


TOP2A 


rs2732786 


noncoding 






19 


44 


TOP2A 


rs 180453 9 


noncoding 






19 


44 


TOP2A 


rsl 804538 


noncoding 






19 


44 


TOP2A 


rs 1804537 


noncoding 






19 


44 


TOP2A 


rsl 141364 


coding-synon 


A A A 1 A Ai*l 

AAA|AALr 


JV|J>w 


23 


48 


KRT10 


rsl 2231 


noncoding 






23 


48 


KRT10 


rsl 132259 


coding-nonsynon 


C A 1 |LAj 1 


TTlp 

xi|K 


23 


48 


KRT10 


rsl 132257 


coding-synon 


crru|i"iu 


T IT 


23 


AC 

4c 


JvJvl 1U 


rsi i jz.zj>o 


cnrlin or— cvn on 


GCC|GCT 


A|A 


| 23 


48 


KR.T10 


rsl 132255 


coding-synon 


CTG|TTG 


L|L " 


23 


48 


KRT10 


rsl 132254 


coding-synon 


GGC|GGT 


G|G 


23 


48 


KRT10 


rsl 132252 


coding-synon 


TTCjTTT 


| F|F 


23 


48 


KRT10 


rsl 132268 


coding-nonsynon 


CAGjGAG 


Q|E 


23 


48 


KRT10 


rsl 132258 


coding-nonsynon 


CGG|TGG 


R|W 
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SEQUENCE 


M 5 ' CCATATATAAAACCACTGTCCTGTCCTTTGTGGCT 


CCCCCATCTGTCTGTCTATATTTGTC 


TGCCTACGCTGACGACTATGTG 


M 5 ' TTTGGTTTTCTACAACTGTTGCTAT 


GGGCTCCACACACCAGATG 


ACGCTCTGAGCACCCTCTACA 


VI 5 ' TGTCACAGGGACTGAAAACCTCTCCTCATGT 


CCCAAGGCCACGAGCTT 


TGTTGCTCTCTTAACGAATCGAAA 


VI 5' CTGGTCAAACAAACTCTCTGAACCCCTCC 


TGGTGAGGAAAAGCGGACAT 


CTGGCTTGGAGGACAGTGAAG 


VI 5 ' CCAAGCCCTCCCCATCCCATGTAT 


GAGGTGTCGTACCGCGTTCTA 


CCGTTCTGCTCTTCCCTGTCT 


VI 5 ' CCAGACCCGCTTCACTGACCTGC 


CGCCTGTACTTCAGCATGGA 


GCGGTTCAGCTGGTGGAA 


5' ACCCCGAGGCATCACCACAAATCAT 


s 

O 
H 
O 
Eh 
O 

I 


TAGGCTCAGAGTCAGACCCAAAC 


tf5' CCCTCGTGGGCTTGTGCTCGG 


AAGCCGCCAGTTCATCTTTTT 


CTTGTGGTTCAAGTCAAATGTTCAG 


VI 5 ' TCTGCCTGCGCTCTCGTCGGT 


GGGCTGGGCACCTGACTT 






in 
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in 
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PRIMER 


CACNBl 


CACNBIFOR 


CACNBIREV 


CDC6 


CDC6FOR 


CDC6REV 


lEBIl-1 


EBIl-l FOR 


EBIl-l REV 


EBIl-2 


EBI1-2F0R 


EBI1-2REV 


GCSF 


GCSFFOR 


GCSFREV 


GRB7 


GRB7 FOR 


GRB7REV 


HKA3 


HKA3 FOR 


HKA3REV 


MLN50 


MLN50 FOR 


MLN50 REV 


MLN64-1 


MLN64-1 FOR 
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SEQUENCE 


5' GCAAGAAATCAGCCATAAGAAAGC 3' 


5' TTGCAGCTCCTCTGAGTAAAACAT 3' 
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reverse 


TGCCGTGCCAGAGAGA 


GCCCAGCCTGTCACTTATTC 


CAGCATTGGATGCAATCC 


AGGACAGTGTGTAGCCCTTC 


GCTGGCCATATATATATTTAAACC 


TGCCTACTGGAAACCAGA 


NGGAGGTTGCAGTGAGCCAAGAT 


I 

U 
O 

i 


GTCTGGGTCTTTATGGNGCTTGTG 


GCATACAGCACCCTCTACCT 


CAGGAGTGAGACACTCTCCATG 


GCGTGTCTGTCTCCATGTGTGC 


CTGGAGGTTGGCTTGTGGAT 


CCACCCAGAAAAACAGGAGA 


TTGTCACCCCATTGCCTTTC 


CTTGTCCCTTCTCAATCCTCC 


GATTACAGTGCTCCCTCTCCC 


GATTACAGTGCTCCCTCTCCC 


TTTAGACTTGGTAACTGCCG 


GCCATGATCTCCCAAAGCC 


AAACTGTGTGTGTCAAAGGATACT 


TACATGAAGGCATGGTCTG 


GCTGCGGACCAGACAGAT 


CCCACGGCTTTCTTGATCTA 


CACTCAACTCAACAGTCTAAAGGTG 


ATTCAGCCTCAGTTCACTGCTTC 


TAGTCTCTGGGACACCCAGA 


ACTGACTGCGCCACTGC 


GAACAGGGTTAGTCCATTCG 


fonvard 


ACAGTCTATCAAGCAGAAAAATCCT 


GACAACAGAGCGAGACTCCC 


TGGTCATTCGACAACGAA 


CTCCAGAATCCAGACCATGA 


GGAAGAATCAAATAGACAAT 


CATAGGTATGTTCAGAAATGTGA 


o 

i 

o 
o 

1 

I 


CAAAAGCTTATGATGCTCAAACC 


TAGGTTCACCTCTCATTTTCTTCAG 


CGGACCAGAGTG1TCCATGG 


AGGGGAGAATAAATAAAATCTGTGG 


TGGATTCACTGACTCAGCCTGC 


TCCCCAATGACGGTGATG 


GGTCCCACGAATTTGCTG 


TCGATCTCCTGACCTTGTGA 


CCTTGGATAGATTCAGCTCCC 


o 

B 

! 
i 


§ 

[ 

o 
o 


AGTTTGACACTGAGGCTTTG 


TGCAGATGCCTAAGAACTTTTCAG 


TCGAGGTTATGGTGAGCC 


GCTGATCTGAAGCCAATGA 


CTAATATAATCCTGGGCACATGG 


GATAAAAACAAGCACTGGCTCC 


TGTAATGTAAGCCCCATGAGG 


GTGAGTTCAAGCATAGTAATTATCC 


GATCCAGTGGAGACTCAGAG 


ATTCCTGAGTGTCrACCCTGTTGAG 


u 

| 

i 

§ 


e 


D17S946 


D17S1181 


D17S2026 


D17S838 . 


D17S250 


D17S1818 


D17S614 


D17S2019 


D17S608 


D17S1655 


D17S2147 


D17S754 


D17S1814 


D17S2007 


D17S1246 


D17S1979 


D17S1984 


D17S1984 


D17S1867 


D17S1788 


D17S1836 


D17S1787 


D17S1660 


D17S2154 


D17S1955 


D17S2098 


D17S518 


D17S1851 


D11S4358 


No. 


r— < 




CO 




»o 


VO 






OS 






CN 


CO 




m 


VO 


r- 


OO 


OS 


O 
CN 


C4 


CN 
CN 


CO 
CN 




to 

CN 


vo 

CN 


r- 


OO 
CN 


OS 
CN 



BHC 03 1 051 



a 
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L32949 
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178 - 200 




VO 
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CN 


r- 




CM 
CM 
i — < 


102-120 


reverse 


AGTCAGCTGAGATTGTGCC 


CCCCACACACAGCTCATATG 


GCAACAGAGGGAGACTCCAA 


TATGGAGTACCTACTCTATGCCAGG 


AGCTGTGACAAATGCCTGTA 


AAGATGGGGCAGGAATGG 1 


AAACACCACACTCTCCCCTG 


GGGGCAGACGACTTCTCCTT 


ATTCACCTAATGAGGATTCTTCTTT 


TCCGCATCCTTTTTAAGAGGCAC 


TGACGTGCTATTTCCTGTTTTGTCT 


CAACACACTACCCCAGGA 


GAGTCCGCTACCTGAGTGCT 


forward 


GTTCTTTCCTCTTGTGGGG 


CAAGCCAAGACATCCCAGTT 


TTTTCTCTCTCATTCCATTGGG 


TCCCATCCCGTAAGACCTC 


ATTCAAAGCTGGATCCCTTT 


TCACCTGAGATTGGGAGACC 


TCACTGTCCTCCAAGCCAG 


TTCTTGGGCTTCCCGTAGCC 


GGGGATACAACCTTTAAAGTTCC 


GCTGAAATAGCCATCTTGAGCTAC 


CTTTCACTCTTTCAGCTGAAGAGG 


GTTTGTTGCTATGCCTGC 


1 

i 


a 


D17S964 


D19S1091 


ON 
GO 

s 


D10S2160 


D17S1230 


D17S1338 


D17S2011 


D17S1237 


D17S2038 


D17S2091 


D17S649 


! D17S1190 


M87506 


No. 


o 

ro 


CO 


CN 

CO 


CO 
CO 


CO 


in 

CO 


VO 
CO 


CO 


OO 
CO 


o\ 

CO 




5r 


Or 
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CLAIMS U Oki 2B03 

1. A method for the prediction of response to cancer treatment, by the detection of at least 
markers characterized in that the markers are genes and fragments thereof or genomi 
nucleic acid sequences that are located on one chromosomal region which is altered ii 
malignant neoplasia. 

2. The method of claim 1 wherein the treatment is an antibody treatment, antihormona 
treatment, anti-growth factor treatment, taxol based treatment, anthracyclin base< 
treatment and platinum salt based treatment. 

3. The method of claim 1 wherein the treatment includes Herceptin™, trastuzumab or 20 
antibodies. 

4. The method of claim 1 characterized in that the markers are: 

a) genes that are located on one or more chromosomal region(s) which is/are altered 
in malignant neoplasia; and 

b) 

i) receptor and ligand; or 

ii) members of the same signal transduction pathway; or 

iii) members of synergistic signal transduction pathways; or 

iv) members of antagonistic signal transduction pathways; or 

v) transcription factor and transcription factor binding site; or 

vi) integral parts of heteromeric complexes 

5. The method of claim 1 or 2 wherein the malignant neoplasia is breast cancer, ovarian 
cancer, gastric cancer, colon cancer, esophageal cancer, mesenchymal cancer, bladder 
cancer or non-small cell lung cancer. 

6. The method of any of claims 1 to 5 wherein at least one chromosomal region is defined as 
the cytogenetic region: lpl3, lq32, 3p21-p24, 5pl3-pl4, 8q23-q24, llql3, 12ql3,17ql2- 
q24, 17ql 1.2-21.3 or 20ql3. 



A method for the prediction, diagnosis or prognosis of malignant neoplasia by the 
detection of at least one marker characterized in that the marker is selected from: 
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a) a polynucleotide or polynucleotide analog comprising at least one of the sequences 
of SEQ ID NO: 319 to 389; 

b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) and encodes a polypeptide 

5 exhibiting the same biological function as specified for the respective sequence in 

Table 2 or 3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (c) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 

10 respective sequence in Table 2 or 3 

d) a polynucleotide or polynucleotide analog which represents a specific fragme^^ 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (d) 

e) a purified polypeptide encoded by a polynucleotide or polynucleotide analog 
sequence specified in (a) to (e) 

15 f) e purified polypeptide comprising at least one of the sequences of SEQ ID NO: 

397-467; 

Are detected. 

8. The method according to any of claims 1 to 6 wherein the markers are selected from: 

a) a polynucleotide or polynucleotide analog comprising at least one of the sequen||^ 
20 of SEQ ID NO: 2 to 6, 8, 9, 11 to 16, 18, 19, 21 to 26, 53 to 76 or 315 to 389 



b) a polynucleotide or polynucleotide analog which hybridizes under stringent 
conditions to a polynucleotide specified in (a) and encodes a polypeptide 
exhibiting the same biological function as specified for the respective sequence in 

25 Table 2 or 3 

c) a polynucleotide or polynucleotide analog the sequence of which deviates from the 
polynucleotide specified in (a) and (b) due to the generation of the genetic code 
encoding a polypeptide exhibiting the same biological function as specified for the 
respective sequence in Table 2 or 3 
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d) a polynucleotide or polynucleotide analog which represents a specific fragmer 
derivative or allelic variation of a polynucleotide sequence specified in (a) to (c) 

e) a purified polypeptide encoded by a polynucleotide sequence or polynucleotic 
analog specified in (a) to (d) 

0 A purified polypeptide comprising at least one of the sequences of SEQ ID NO: 2 
to 52 or 76 to 98 or 393 to 467 

are detected. 



A diagnostic kit for conducting the method of claims 1 to 8. 
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METHODS AND COMPOSITIONS FOR THE PREDICTION. DIAGNOSIS. PROGNOSIS. 
PREVENTION AND TREATMENT OF MALIGNANT NEOPLASIA 

ABSTRACT OF THE DISCLOSURE 

The invention provides novel compositions, methods and uses, for the prediction., diagnosis, 
prognosis, prevention and treatment of malignant neoplasia and breast cancer in particular. Genes 
that are differentially expressed in breast tissue of breast cancer patients versus those of normal 
people are disclosed. 
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SEQUENCE LISTING 28. Qfa ^flgg 

<110> Bayer AG 



<120> METHODS AND COMPOSITIONS FOR THE PREDICTION, DIAGNOSIS, PROGNOSIS 
PREVENTION AND TREATMENT OF MALIGNANT NEOPLASIA N ° SIS ' 



<130> BHC031051 



<160> 512 



<170> Patentln version 3.1 

<210> 1 

<211> 3846 

<212> DNA 

<213> Homo sapiens 



<400> 1 

gcctcccgcc 

gcttttctcg 

gagaaggtga 

aagatgacac 

cactacccca 

caacagagtg 

aaaggtttca 

cagatcagta 

ggcgagggca 

ctggagcagc 

cage age egg 

cagcgcagcg 

gecgacgagg 

gaegaegget 

aactacgtgg 

catcgcatcc 

gegacagett 

gccacttctg 

tctctggatg 

ttggcctgtg 

ctcacatacc 

ggctccctac 

tctgccgctg 

ctgcccacct 

gttcctgctg 

gccctcctgc 

tcaggttctt 

ctcttccgca 

ctaggctgga 

tcagcctccc 



agctcgcctc 

gaaccatgaa 

actgtctgga 

tgaacatgaa 

agcagtcctt 

agctccagag 

gcgtagtggc 

atataaaata 

tggagecaga 

agcagcctca 

tggcccagtc 

ccccaggtgg 

acgaggtctc 

ggatgtacgg 

aggecatctg 

gtcctgggcg 

gtgattccta 

cgggctccct 

gaaegggtat 

ggcctcacct 

cacacattcc 

attccccaga 

tgtagggacc 

ggggtctctc 
tggggcgaat 
ecacgcagac 
gectttctta 
gcccctttcc 
agccatggtc 
tcagccccca 



ggggaacagg 

ccccaactgc 

taagttctgg 

gaactacaag 

caccatggtg 

teaggtgege 

agacacgccc 

ccatgaggag 

gegtegggat 

ccacatcccg 

ctatggtggc 

tggcgggaag 

cttccaggac 

gacggtggag 

aacccggagc 

tgagccgtcc 

cccctcttcc 

cctctggcag 

gggcctctct 

gcccctctgt 

agggctgggg 

gtgggatcca 

agtgggatgg 

tccctacctc 

tcatcccctc 

ctgcagcggg 

attttaggga 

ccacgcccac 

ccgaagtgta 

ggatctgggt 



acgcgcgtga 
gcccggtgcg 
cataaagcat 
ggctacgaga 
gcggacaccc 
tacaaggagg 
gagctccaga 
tttgagaaga 
tcacaggacg 
accagtgccc 
tacaaggagc 
cggtaccgcg 
ggggacacca 
cgcaccggcg 
gcccccatct 
attcttcagt 
agcttctttt 
gcttcccccg 

sggggaggca 

tctctcccct 
tgagectgae 
cttcttggtt 
gctctacctc 
cctccfccagg 
cccgcgcgtt 
caaagagctc 
cagctaccgg 
ccccagtctc 
gggcaagggt 
taggtggccg 



getcaggegt 

geaagategt 

gcttccattg 

agaageccta 

eggaaaaect 

agtttgagaa 

gaatcaagaa 

gccgcatggg 

gcagcagcta 

eggtttacca 

ctgcagcccc 

cggtgtatga 

tegtcaaegt 

acaeggggat 

gtcttcagca 

gtctctgttt 

gecaactgaa 

tgatcgactt 

gggctggaat 

cacatcctcc 

tgccaggacc 

cctgggatgg 

tctttctcaa 

ggcaacaaea 

ccttcgcaca 

ccgaggaagc 

aaggagggga 

cagggaccct 

gcctcaggac 

ctcctccctg 



ccccgcccca 

gtatcccacg 

cgagacctgc 

ctgcaacgca 

tcgcctcaag 

gaaeaagggc 

gacccaggac 

ccctagcggg 

ccggcggccc 

gcagccccag 

agtctccata 

ctacagcgcc 

gcagcagatc 

gctgccggcc 

cattccacgg 

tttaaaacct 

gccttcttct 

cttggttttc 

gggagacctg 

tgcccagctc 

ccaggtcagg 

cgatggggac 

agagggggct 

ggagaatggg 

ctgtgatttt 

acagcttggg 

acaaggagtt 

tgcctgcctc 

cttttggtct 

ctcctcatgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



BHC03 1 051 



gaagatgtct cagagccttc catgacctcc 
agctgcacaa agtcagcagg gaccactaaa 
catgtatgtc tgcaggtgtc tgacacgcaa 
gggggtgtgt ctgtaggtgt ctctgggcct 
agagctgtct tcccctgaga gtttcctcag 
gcagagaagt tccttaggtt ttctttggaa 
ggaggaagcc caccaatctg ccctttgcag 
tggagttggg gctgccatag ggtctgcagc 
tgtcactcca ggcatatgtt tccccatctc 
gtgtcaccct gtgggtgtct ccctcggggg 
taaagctccc ttgaagcaag aaagagggtc 
ggggatgggg agggaaagac ggtgctatat 
gtgacaaccc tggcctcact tgattcatct 
catcccattt tcatcctgag cccaaccagg 
acttgtaccc acaggtgagg ggcaggacct 
tcatgggtgt ttttgtcaac tgcttgttaa 
ggttgaggaa aagactgtgg gtggggaggc 
ccccagccta ggtggaggca agtggaatat 
ggcagagaat ctcttgggag tcttgggtgg 
aaagcacaat gtggatttgg ggaccaaagg 
tttgggagag tggtgagtgg aagggaggag 
taattctcct tcccagataa ggcaagccag 
actcttcctg tcctaaaaat aggggccgtt 
tgtcacactg gtgctgagtg accgggggct 
catccctaga agagcacaga gccctgaggg 
ttctctacag ttcacagagg tctttcagct 
tagaatgtga atataacttt tgtgggccaa 
ggaaagtgcg ttctcccagc actgcctcct 
aatgacttta ttgcttaafct tctgcctttc 
tttatagctg gaaaaaacaa aataccaccc 
tgaccacgtg aaatttgcct ctgtccaaac 
agtgtgtgaa gccgccagtt catcttttta 
ttggtcccct ccctcgtggg cttgtgctcg 
tgaacatttg acttgaacca caagtgaatc 
ttttta 
<210> 2 

<211> 1711 

<212> DNA 

<213> Homo sapiens 
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cctccccagc ccaatgccaa gtggacttgg 
tctccaagac ctggtgtgcg gaggcaggag 
gtgtgtgagt gtgagtgtga gagatggggc 
gtgtgtgggt ggggttatgt gagggtatga 
aacccacagt gagaggggag ggctcctggg 
tgaaattcct ccttcccccc atctctgagt 
tgtgtcaggg tggaaggtaa gaggttggtg 
ctgctggggc taagcggtgg aggaaggctc 
tgtctggggc tacagaatag ggtggcagaa 
ctcttcccct agacctcccc ctcacttaca 
ccagggctgc aaaactggaa gcacagcctc 
ccagttcctg ctctctgctc atgggtggct 
ctggttttct tgccaccctc tgggagtccc 
ccctgccatt ggcctcttgt cccttggcac 
gaaggtattg gcctgttcaa caatcagtca 
ttgatttggg gatgtttgcc ccgaatgaga 
cctgcctgac ccatcccttt tcctttctgg 
cttatattgg gcgatttggg ggctcgggga 
cgctggtgca ttctgtttcc tcttgatctc 
tcagggacac atccccttag aggacctgag 
cagcaagaag cagcctgttt tcactcagct 
tcatggaatc ttgctgcagg ccctccctct 
ttcttacaca cccccagaga gaggagggac 
gctgggcgtc tgttctttac caaaaccatc 
gctgggctgg gctgggctga gcccctggtc 
catttaatcc caggaaagag gcatcaaagc 
tactaagaat aacaagaagc ccagtggtga 
gttttctccc tctcatgtcc ctccagggaa 
ccccctcaca catgcacttt tgggcctttt 
tacaaacctg tatttaaaaa gaaacagaaa 
atttcatccg tgtgtatgtg tatgtgtgtg 
tatggggttg ttgtctcatt ttggtctgtt 
ggatcaaacc tttctggcct gttatgattc 
tttctcctgg tgactcaaat aaaagtataa 



<400> 2 

gagggaaggc aggaaggagg 



gccggcggcg ggaggggagg 
cttacccacc ctcccaggag 
acagcaagag gaaagggcga 
ccaacagctt tgtccgccag 
atgtatctct ggaggaggac 
cgcagctcga gaaggccaag 
acaatccgtc tccaggggat 
aagacttcct gcacatcaag 
aggagggctg tgaggttggc 
tgcaggaaca gaagctgcgc 
gttccagtct gggagatgtg 
aacagaagca gaagtcgaca 
ccatcatcct ggtgggaccg 
ctttatttga cttcttgaag 
cagatatttc cctggctaag 
agcgctccaa cacacgctcc 
agctggcccg gacccttcag 
agctgtccaa gacctcgctg 



cagccgaagg ccgagctggg 
cccacggcct ctcccatgcg 
ctcctctcca tggtccagaa 
atccccatgg aggtcttcga 
ttcaaacggt cagatgggag 
ggctcagcgg agtcctacac 
cgggaagcct taaggaagga 
accaagccag tggcatttgc 
gaggtgcctg tgcagggagt 
gagaaataca ataatgactg 
ttcattccca gccccgtcaa 
cagaaccgcc tcggctccag 
gtgactggca cccgccgccc 
gagcatgtgc ccccctatga 
tcgctcaagg gctacgaggt 
catcggtttg atggcaggat 
cgctcagttc tcaacaaccc 
agcctggctg aggtgcagag 
ttggtcgctc tggatgctga 
gcccccatca ttgtttacat 



tggctggacc gggtgctggc 
ctgagggagc ccggctgcgg 
gaccagcatg tcccggggcc 
ccccagcccg cagggcaaat 
cacgtcctcg gataccacat 
cagccgtcca tcagactctg 
agcagagcgc caggcattag 
tgtgcggaca aatgttggct 
ggccatcacc ttcgagccca 
gtggatcggg cggctggtga 
actggacagc cttcgcctgc 
caaatcaggc gataactcca 
cacaccccct gccagtgcca 
cgtggtgcct tccatgaggc 
tacagacatg atgcagaaag 
ctccatcact cgtgtgacgg 
cagcaaacac atcatcattg 
tgaaatcgag cgaatcttcg 
caccatcaat cacccagccc 
caagatcacc tctcccaagg 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120. 

31801 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3846 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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tacttcaaag 
tagcggcctc 
agaaccaatt 
ccacacaccc 
cagccctgcg 
tcaggagaaa 
cccaggagca 
gtcggaactg 
ataagatgag 
<210> 3 



gctcatcaag 
ggaaaagctg 
ggaggatgcc 
gcccagcagc 
ccgtagccct 
cctcggcttc 
ggaacatgcc 
gagtgcaggg 
cggcaaaaaa 



tcccgaggaa 
gcacagtgcc 
tgcgagcatc 
acgccaccca 
gcccctgtct 
tggggcgggc 
atgtagtggg 
aacatggagg 
aaaaaaaaaa 



agtctcagtc 
cccctgaaat 
tggcggagta 
atccgctgct 
ccaacctcca 
tggagtcctc 
cgccctgccc 
aggaagggaa 



caaacacctc 
gtttgacatc 
cttggaagcc 
gaaccgcacc 
ggtacaggtg 
acagcggggc 
gtcttccctc 
gagctttatt 



aatgtccaaa 
atcctggatg 
tattggaagg 
atggctaccg 
ctcacctcgc 
agtgtggtgc 
ctgctctggg 
ttgtaaaaaa 



<211> 698 
<212> DNA 
<213> Homo sapiens 



<400> 3 

ttttcctttc 

ctctagtgtc 

aatcgccaat 

ccgcaagcct 

gaagggcagg 

gaaggtcaca 

atctaagaag 

gttcaaaaac 

caagaagctc 

gcgccgtgaa 

ggaagagacc 

atagatcagc 

<210> 4 



gctgctgcgg 
ctccgctgtg 
gccaactccc 
gtgacggtcc 
cacatgggca 
tggatgagga 
atcgatcgcc 
aagcggattc 
ctggctgacc 
gagcgcctcc 
aagaaataaa 
cattaaaata 



ccgcagccat 
gcaagaagaa 
gtcagcagat 
attcccgggc 
taggtaagcg 
gaatgaggat 
acatgtatca 
tcatggaaca 
aggctgaggc 
aggccaagaa 
acctcccact 
aaacaagcct 



gagtatgctc 
ggtctggtta 
ccggaagctc 
tcgatgccgg 
gaagggtaca 
tttgcgccgg 
cagcctgtac 
catccacaag 
ccgcaggtct 
ggaggagatc 
ttgtctgtac 
taatctgc 



aggcttcaga 
gaccccaatg 
atcaaagatg 
aaaaacacct 
gccaatgccc 
ctgctcagaa 
ctgaaggtga 
ctgaaggcag 
aagaccaagg 
atcaagactt 
atactggcct 



agaggctcgc 
agaccaatga 
ggctgatcat 
tggcccgccg 
gaatgccaga 
gataccgtga 
aggggaatgt 
acaaggcccg 
aagcacgcaa 
tatccaagga 
ctgtgattac 



<211> 5810 

<212> DNA 

<213> Homo sapiens 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1711 



60 
120 
180 
24 0 
3 00 
360 
420 
480 
540 
600 
660 
698 



<400> 4 

gggaagatgg 

tccttcccgg 

gtttcctccg 

gccttcagga 

ttctctcctg 

taagcttgtg 

tttggtcagc 

gactgatcgt 

tggcactgaa 

aggacagctt 

gcttgtacag 

tgttaatctg 

ccaatcctta 

tggtcccttg 

tcatttaatg 

atctcccatc 

gacaattgaa 

acatccagtt 

tgatcttcct 

tgttcagaaa 

acccctgtat 

gaatcacaac 

caaggatgct 



cggcggcctc 

gaagagtagg 

ttagtggtgg 

tgaaagctca 

gaacggctcc 

cgtcaagtca 

tgtttggaga 

ttggagtcca 

tgttacatca 

tgtgatgtaa 

cagctaaggg 

tataaccttc 

gaacaagatc 

gataagattc 

aacctgaagt 

attttgcatg 

ggaacatctg 

gacaataaat 

gcctgtttct 

ctgcagaact 

gaactgatca 

atgagatttt 

cctcttccag 



gagcaccctc 
gactgctggt 
ggaagggctt 

ggggggaaac 

atgcaaaatt 

tggagaagag 

cattgcagaa 

tagcaggaca 

cgtcagatat 

aagtggctca 

aaaaaaattc 

caggggacaa 

tttctaaaat 

ttcatggaag 

actatgtctc 

agaataatgt 

ctgtgtacaa 

ggaccccttc 

tcttgaaatt 

gcacaggaat 

ctcagtttga 

atgctgctct 

atggccgaag 



ctcttcttgc 

gccctgcgtc 

atccttttgt 

cgaggagtca 

taaccaaaat 

ggttgtgatg 

ggctctcaaa 

gaatggactg 

gttctatgtg 

ccatggggag 

tgatgaattt 

caaactgaag 

ggcaattatg 

tgttggctat 

tccttctgac 

ttctcgatct 

actcccaatt 

cttctcctca 

tccccagcca 

tccattgttt 

gctatcaaag 

tcctggtcag 

tctacaggga 



cgccggggac 

ccgggatccc 

ggcggatcta 

gaaaagctga 

agaccctgga 

agttctggag 

gtaacatctt 

ggctctcatc 

gaagtgcagt 

aatcctgtga 

tctaagcacc 

actaaaatgt 

tactggaaag 

ctcacaccaa 

ctactggatg 

ttgggcatga 

gcaccattaa 

atcaccagtg 

atcccagtat 

gaaactcaac 

gaccctgacc 

cagcactgct 

acccttgtta 



ttcagattga 60 

gagccaactt 120 

gcttctcctc 180 

gtaagatgag 240 

gtgaaaccat 300 

ggcatcaaca 360 

taccagcaat 420 

tcagtgccag 480 

tagatcctgc 540 

gctgtccgga 600 

ttaagggcct 660 

acttggctct 720 

caactaatgc 780 

ggagtggggg 840 

acaagactgc 900 

atgcatcagt 960 

ttatggggtc 1020 

ccaacagtgt 1080 

ctagagcatt 114 0 

caacttatgc 1200 

ccataccttt 1260 

atttcctcaa 132 0 

gcaaaatcac 1380 
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ctttcagcac 
taacaccctc 
tctccaattt 
tgtgaatgac 
taaactctac 
tgttcaaaga 
cattcaagcc 
aaagaacctg 
gagtggtacc 
tatgagcatg 
ggtgtctcag 
cattggctcg 
caacaccaag 
ctcaaccctt 
catggaaata 
tgagaaacca 
tgactcacag 
catcactcca 
tcacccccaa 
agatgtaact 
tgatgattgc 
gagtaccctg 
tccagctgat 
tttttttcat 
ttttggagaa 
atttgcatct 
gaccaagttt 
cggcaaagct 
actgaccact 
caccagtaat 
gaccccttct 
tgagggaaag 
aggtggatct 
accacccatt 
ttcctctcac 
ccatagccat 
taaatcagaa 
ttctggatct 
cataaccaag 
gccatcatca 
gccacctgga 
atcctctaaa 
tagttcaagc 
gaaaactccc 
ctcttccatg 
aaacaagaag 
tggccctggg 
ccatccfcatg 
aagtgataaa 
gacctcagag 
tgatggaggc 
tggagaaggg 
tggttccact 
cccccagaat 
gaatagtccc 
taagaagcac 
caaagaccga 
catctcttca 
ctcaaggctc 
cctgattggg 
attgataagt 
agaatcctaa 
ccctattaaa 
gccagaaaga 
agagggaagg 
ttccattttt 



cctggccgag 
attggaagct 
gaagtgtgtc 
tccctggtgt 
aaagggctgt 
tgtatgtcca 
gac.accccag 
cccccggcta 
actacatcaa 
agcatcaaag 
aacccaattc 
agtccgaccc 
aaccacccga 
tatggaagca 
tgctcgggga 
aagcaccaga 
aaccctatct 
gctccaagcc 
cccagtattc 
gacatccttt 
ccagccattg 
tttgactctg 
cttattgcag 
gatggagtag 
gaatattttg 
caggcactaa 
aagggcaata 
ttagctcctg 
ggggacttag 
agtactctct 
aatgatggga 
tctccatctc 
aaatcgccag 
cccaaaatca 
agtcagtata 
tcttcctcct 
ggttcatcaa 
agccagfccca 
catggactga 
cttatgaatc 
ggctctgaca 
gccaagtccc 
tctggcatga 
ccatcatcta 
tcatcctctc 
ccgtccttga 
ggtgaagacc 
tcctccaaac 
gacaaatcaa 
tcaaaaaatg 
tcccctagca 
cttaggcctc 
ccaaagcatg 
ctggacagtg 
agctcagacg 
aaaaaggaaa 
gacaagaaaa 
gaccagtcct 
agcccagact 
aattaggaac 
ttataggcaa 
atggcatggc 
gaaaccacag 
aagttaaaat 
gagggaaaca 
aggccatgtt 



ttcctcttat 
gtgtcaaaag 
ctctctcaga 
gtgtggtaat 
cggatgcact 
tccctgtgac 
cactgtccct 
gcagcccagg 
ccaacacctt 
atcggcatga 
ttaccagttt 
ctcctcatca 
tgctcatgaa 
gccctttaga 
gcaacaagac 
ctgaagatga 
ttgatgtcaa 
agtgtagcac 
aaaggatggt 
cagacattgc 
gcacccctct 
atgtctttca 
atgctgctgg 
atttcaatcc 
atgaaagcag 
atactttggg 
accaagccga 
cagatcttat 
ggaaagaaaa 
cggggcccgg 
aaagcaaaga 
atagttcttc 
gcagtgcagg 
ctattcagat 
ccagcagtgg 
cttcctcatc 
gttccaagtt 
aaaattcatc 
gcagtggctc 
cttctttaag 
agcttgcctc 
ctatcagttc 
agtcatcttc 
attcctgtac 
agaaccagca 
cagctgtcat 
cactggacgg 
ataacatgtc 
aggtttccac 
tggggagcac 
ttaaagccaa 
aaatggcttc 
agcgtggctc 
aaagtgagtc 
atggtatccg 
agaagaaagt 
aatctcatag 
tgtctatgac 
ttatgattgg 
cttatttcct 
accaccataa 
tttgacatca 
ggtttgattc 
acttgcttaa 
gttttgtggg 
ttaaactcat 



cctaaatctg 
aactattctg 
gtctcgtttc 
ggatgtgcag 
gatctgcaca 
gatgagggct 
cattgcagag 
gtatggcatg 
tccggggggt 
gtcggtgggc 
gttgcaaatc 
cacgccgcca 
ccttctcaaa 
aaggcagaac 
caagaaaaag 
ctrttcagagg 
catgacagct 
tcccccaaca 
ccgactatcc 
agaagaagct 
tcgagattct 
aactaacaat 
aagccccagt 
tgatttattg 
ccaaagtggg 
ggtgccaatg 
cacagttgat 
ggagcatcac 
gactcaaaag 
attagacagc 
taagcctcca 
taacagacct 
aagatctcag 
tcctaaggga 
ttctgtgtct 
tgcttccacc 
aagtagcagt 
ccagtctggg 
tagcagcacc 
taaaccaaac 
tccaatgaag 
aggttctggt 
agggttagga 
ggcatcttcc 
tgggagttct 
agataaactg 
ccagatgggg 
aggaggagag 
ctccgggagt 
aggtgtggca 
agtgactttg 
ttctaaaaac 
tcccagccat 
aggctcctcc 
accacttcca 
aaaagacaaa 
catcaagcca 
aagtaacaca 
ggaggaagat 
aaaagaaaca 
ggggtgagtc 
agctgggtga 
tggttaccag 
gaaagggagg 
aaatattcat 
tttagtgcat 



atcagacacc 
aaagaagatt 
agcgtatctt 
ggcttaacac 
gatgacttca 
attcggagga 
acagttgaag 
accacaggca 
cccattgcca 
catggggagg 
acagggaacg 
cctgtctctt 
gataatcctg 
tcctcttccg 
aagtcatcaa 
gagctatttt 
gacacgctgg 
acttacccac 
agttcagaca 
tctaaacttc 
tcaagctctg 
aatgaaaatc 
agtgactctc 
aacagccaga 
gataatgatg 
cttggaggtg 
ttcagtatta 
agtggtagtc 
agggtaaagg 
aaaccaggga 
aagcggaaga 
tttaccccac 
actcccccag 
acagtgatgg 
tcctcaggca 
tcagggaaga 
atgtattcta 
gggaagccag 
aagatgaaac 
atatcccctt 
cctgttcctg 
ggttctcata 
tcctcaggct 
tcctcctttt 
aaaggaaaat 
aagcatgggg 
gtgagcacaa 
tttcagggca 
tcagtggatt 
aaaattatca 
cagaaacctg 
tatggctctc 
agtaagtcac 
atagcagaga 
gaatacagca 
gatagggacc 
gagagttggt 
atcttatctg 
gatgatctta 
gggccagagg 
agacaggtct 
attagaaagg 
gaagtcttct 
ggggtgggag 
atatattttc 
gtatatgaag 



aagtggccta 
ctcctgggct 
ttcagcaccc 
atgtgagctg 
ttgccaaagt 
aagctgaaac 
acatggtgaa 
acaacccaat 
ccttgtttaa 
acttcagcaa 
gggggtctac 
cgatggccgg 
cccaggattt 
gctcaccccg 
gattaccacc 
caatggatgt 
atacgccaca 
aaccagtacc 
gcattggccc 
ccagcactag 
ggcattctca 
catacactga 
ctaccaatca 
gccaaagtgg 
atttcaaagg 
ataatgggga 
tttcagtagc 
agggtccttt 
aaggcaatgg 
agcgcagtcg 
aggcagacac 
ctaccagtac 
gtgttgccac 
tgggcaagcc 
gcaaaagcca 
tgaaaagcag 
gccaggggtc 
gctcctctcc 
ctcaaggaaa 
ctcattcaag 
gaactcctcc 
tgtctggaac 
cgttgtccca 
cctcaagtgg 
ctcccagcag 
ttgtcaccag 
attcttccag 
agcgtgagaa 
cttctaagaa 
tcagtaagca 
gggaaagtag 
cactcatcag 
cagcatatac 
aatcttatca 
cagagaaaca 
gagaccggga 
ccaaatcacc 
cagacagacc 
tggatgtggc 
aaaaaaaact 
gatttggtta 
catatccaga 
ttgttcctgt 
gggtgtaggg 
ttctcccttt 
ggctgggcag 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
276(A 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
40 
414 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



IP 
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5- 



aaaatgaaaa 
aggtagttgt 
gtaatgtttg 
tatttgtaat 
atgaggttta 
agcttatagc 
tttagagtta 
agggttcaga 
<210> 5 



agcaatacat 
ccgtttgagt 
gtgtctgttt 
attttaaccc 
gggttcaaaa 
ttgtgctaat 
aatataatag 
gatcatagga 



tccttgatgc 
catgggcaaa 
cttaggagtg 
tttatctgtt 
tgatggaagg 
actgtagcat 
aaaaaccaaa 
atattaggag 



atttgcatga 
tgaaggactt 
actgggggag 
tgtttttata 
ccgaagagca 
caagcccaag 
atgatatttt 
ttacctctct 



<211> 5515 

<212> DNA 

<213> Homo sapiens 



aggttgttca 
tggtcatttt 
ggaagattat 
cagtgtttcg 
aggcttatat 
caaattagtc 
tattttagga 
gtggaggtat 



actttgtttg 
ggacacttaa 
tttagctatt 
ttctaaatct 
99tggtaggg 
agagcccgcc 
gggtttaaat 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5810 



<400> 5 

cttttttccc ttcttcaggt caggggaaag ggaatgccca attcagagag acatgggggc 60 

aagaaggacg ggagtggagg agcttctgga actttgcagc cgtcatcggg aggcggcaqc 120 

tctaacagca gagagcgtca ccgcttggta tcgaagcaca agcggcataa gtccaaacac 180 

tccaaagaca tggggttggt gacccccgaa gcagcatccc tgggcacagt tatcaaacct 240 

ttggtggagt atgatgatat cagctctgat tccgacacct tctccgatga catggccttc 300 

aaactagacc gaagggagaa cgacgaacgt cgtggatcag atcggagcga ccgcctgcac 360 

aaacatcgtc accaccagca caggcgttcc cgggacttac taaaagctaa acagaccgaa 420 

aaagaaaaaa gccaagaagt ctccagcaag tcgggatcga tgaaggaccg gatatcggga 480 

agttcaaagc gttcgaatga ggagactgat gactatggga aggcgcaggt agccaaaagc 540 

agcagcaagg aatccaggtc atccaagctc cacaaggaga agaccaggaa agaacgggag 600 

ctgaagtctg ggcacaaaga ccggagtaaa agtcatcgaa aaagggaaac acccaaaagt 660 

tacaaaacag tggacagccc aaaacggaga tccaggagcc cccacaggaa gtggtctgac 720 

agctccaaac aagatgatag cccctcggga gcttcttatg gccaagatta tgaccttagt 780 

ccctcacgat ctcatacctc gagcaattat gactcctaca agaaaagtcc tggaagtacc 840 

tcgagaaggc agtcggtcag tcccccttac aaggagcctt cggcctacca gtccagcacc 900 

cggtcaccga gcccctacag taggcgacag agatctgtca gtccctatag caggagacgg 960 

tcgtccagct acgaaagaag tggctcttac agcgggcgat cgcccagtcc ctatggtcga 1020 

aggcggtcca gcagcccttt cctgagcaag cggtctctga gtcggagtcc actccccagt 1080 

aggaaatcca tgaagtccag aagtagaagt cctgcatatt caagacattc atcttctcat 1140 

agtaaaaaga agagatccag ttcacgcagt cgtcattcca gtatctcacc tgtcaggctt 1200 

ccacttaatt ccagtctggg agctgaactc agtaggaaaa agaaggaaag agcagctgct 1260 

gctgctgcag caaagatgga tggaaaggag tccaagggtt cacctgtatt tttgcctaga 1320 

aaagagaaca gttcagtaga ggctaaggat tcaggtttgg agtctaaaaa gttacccaga 1380 

agtgtaaaat tggaaaaatc tgccccagat actgaactgg tgaatgtaac acatctaaac 1440 

acagaggtaa aaaattcttc agatacaggg aaagtaaagt tggatgagaa ctccgagaag 1500 

catcttgtta aagatttgaa agcacaggga acaagagact ctaaacccat agcactgaaa 1560 

gaggagattg ttactccaaa ggagacagaa acatcagaaa aggagacccc tccacctctt 1620 

cccacaattg cttctccccc accccctcta ccaactacta cccctccacc tcagacaccc 1680 

cctttgccac ctttgcctcc aataccagct cttccacagc aaccacctct gcctccttct 1740 

cagccagcat ttagtcaggt tcctgcttcc agtacttcaa ctttgccccc ttctactcac 1800 

tcaaagacat ctgctgtgtc ctctcaggca aattctcagc cccctgtaca ggtttctgtg i860 

aagactcaag tatctgtaac agctgctatt ccacacctga aaacttcaac gttgcctcct 1920 

ttgcccctcc cacccttatt acctggaggt gatgacatgg atagtccaaa agaaactctt 1980 

ccttcaaaac ctgtgaagaa agagaaggaa cagaggacac gtcacttact cacagacctt 2040 

cctctccctc cagagctccc tggtggagat ctgtctcccc cagactctcc agaaccaaag 2100 

gcaatcacac cacctcagca accatataaa aagagaccaa aaatttgttg tcctcgttat 2160 

ggagaaagaa gacaaacaga aagcgactgg gggaaacgct gtgtggacaa gtttgacatt 2220 

attgggatta ttggagaagg aacctatggc caagtatata aagccaggga caaagacaca 2280 

ggagaactag tggctctgaa gaaggtgaga ctagacaatg agaaagaggg cttcccaatc 2340 

acagccattc gtgaaatcaa aatccttcgt cagttaatcc accgaagtgt tgttaacatg 2400 

aaggaaattg tcacagataa acaagatgca ctggatttca agaaggacaa aggtgccttt 2460 

taccttgtat ttgagtatat ggaccatgac ttaatgggac tgctagaatc tggtttggtg 2520 

cacttttctg aggaccatat caagtcgttc atgaaacagc taatggaagg attggaatac 2580 

tgtcacaaaa agaatttcct gcatcgggat attaagtgtt ctaacatttt gctgaataac 2 64 0 

™^?^ aaa tcaaacta 9 c agattttgga cttgctcggc tctataactc tgaagagagt 2700 

cgcccttaca caaacaaagt cat tact ttg tggtaccgac ctccagaact actgctagga 2760 

gaggaacgtt acacaccagc catagatgtt tggagctgtg gatgtattct tggggaacta 2820 
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ttcacaaaga agcctatttt tcaagccaat ctggaactgg ctcagctaga actgatcagc 2 880 

cgactttgtg gtagcccttg tccagctgtg tggcctgatg ttatcaaact gccctacttc 2940 

aacaccatga aaccgaagaa gcaatatcga aggcgtctac gagaagaatt ctctttcatt 3000 

ccttctgcag cacttgattt attggaccac atgctgacac tagatcctag taagcggtgc 3060 

acagctgaac agaccctaca gagcgacttc cttaaagatg tcgaactcag caaaatggct 312 0 

cctccagacc tcccccactg gcaggattgc catgagttgt ggagtaagaa acggcgacgt 3180 

cagcgacaaa gtggtgttgt agtcgaagag ccacctccat ccaaaacttc tcgaaaagaa 3240 

actacctcag ggacaagtac tgagcctgtg aagaacagca gcccagcacc acctcagcct 3300 

gctcctggca aggtggagtc tggggctggg gatgcaatag gccttgctga catcacacaa 3360 

cagctgaatc aaagtgaatt ggcagtgtta ttaaacctgc tgcagagcca aaccgacctg 3420 

agcatccctc aaatggcaca gctgcttaac atccactcca acccagagat gcagcagcag 3480 

ctggaagccc tgaaccaatc catcagtgcc ctgacggaag ctacttccca gcagcaggac 3540 

tcagagacca tggccccaga ggagtctttg aaggaagcac cctctgcccc agtgatcctg 3600 

ccttcagcag aacagatgac ccttgaagct tcaagcacac cagctgacat gcagaatata 3660 

ttggcagttc tcttgagtca gctgatgaaa acccaagagc cagcaggcag tctggaggaa 372 0 

aacaacagtg acaagaacag tgggccacag gggccccgaa gaactcccac aatgccacag 3780 

gaggaggcag cagcatgtcc tcctcacatt cttccaccag agaagaggcc ccctgagccc 3840 

cccggacctc caccgccgcc acctccaccc cctctggttg aaggcgatct ttccagcgcc 3900 

ccccaggagt tgaacccagc cgtgacagcc gccttgctgc aacttttatc ccagcctgaa 3960 

gcagagcctc ctggccacct gccacatgag caccaggcct tgagaccaat ggagtactcc 4020 

acccgacccc gtccaaacag gacttatgga aacactgatg ggcctgaaac agggttcagt 4080 

gccattgaca ctgatgaacg aaactctggt ccagccttga cagaatcctt ggtccagacc 4140^ 

ctggtgaaga acaggacctt ctcaggctct ctgagccacc ttggggagtc cagcagttac 4200^ 

cagggcacag ggtcagtgca gtttccaggg gaccaggacc tccgttttgc cagggtcccc 4260 

ttagcgttac acccggtggt cgggcaacca ttcctgaagg ctgagggaag cagcaattct 432 0 

gtggtacatg cagagaccaa attgcaaaac tatggggagc tggggccagg aaccactggg 43 80 

gccagcagct caggagcagg ccttcactgg gggggcccaa ctcagtcttc tgcttatgga 4440 

aaactctatc gggggcctac aagagtccca ccaagagggg gaagagggag aggagttcct 4500 

tactaaccca gagacttcag tgtcctgaaa gattcctttc ctatccatcc ttccatccag 4560 

ttctctgaat ctttaatgaa atcatttgcc agagcgaggt aatcatctgc atttggctac 4620 

tgcaaagctg tccgttgtat tccttgctca cttgctacta gcaggcgact taggaaataa 4680 

tgatgttggc accagttccc cctggatggg ctatagccag aacatttact tcaactctac 4740 

cttagtagat acaagtagag aatatggaga ggatcattac attgaaaagt aaatgtttta 4800 

ttagttcatt gcctgcactt actggtcgga agagagaaag aacagtttca gtattgagat 4860 

ggctcaggag aggctctttg atttttaaag ttttggggtg gggggttgtg tgtggtttct 4920 

ttcttttgaa ttttaattta ggtgttttgg gtttttttcc tttaaagaga atagtgttca 4980 

caaaatttga gctgctcttt ggcttttgct ataagggaaa cagagtggcc tggctgattt 504 0 

gaataaatgt ttctttcctc tccaccatct cacattttgc ttttaagtga acactttttc 5100 

cccattgagc atcttgaaca tacttttttt ccaaataaat tactcatcct taaagtttac 5160 

tccactttga caaaagatac gcccttctcc ctgcacataa agcaggttgt agaacgtggc 5220 

attcttgggc aagtaggtag actttaccca gtctctttcc ttttttgctg atgtgtgctc 5280 
tctctctctc tttctctctc tctctctctc tctctctctc tctgtctgtc tcgcttgctc 5340 
gctctcgctg tttctctctc tttgaggcat ttgtttggaa aaaatcgttg agatgcccaa 54 00 
gaacctggga taattcttta ctttttttga aataaaggaa aggaaattca aaaaaaaaaa 546 ^^ 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 55l^p 
<210> 6 



-in v 



<212> DNA 

<213> Homo sapiens 



<400> 6 

gaattctagg 

atgggcctct 

gccaagtgac 

tgtgtctcta 

caaggtccct 

ggcctccccg 

gtgtgtgctg 

cagaaatgtg 

acaccaggct 

tatttttcta 



cccagttctg 
gtaagatgag 
tcaagagtcc 
tttagaactt 

ggggagcctc 

tgctcatttt 
aaagtataat 
ggtctaaaac 
gtgattacct 
caccacaggg 



tgtttcccct 
gagttggaga 
caggcagcac 
aatccaacct 
ctagagggag 
tgtaccacca 
tgaagttgtg 
atacaaatgc 
gctcacatac 
tgaggaacag 



gtgtgttcct 
ggtacattct 
cagcacccct 
acccaacatc 
agagccctgc 
cagtgcttgg 
tatatatgtc 
tccaaatggg 
atgtgtctat 
gtatatgtgt 



aggcaggtca 
caggctactt 
atctccaagg 
agatcagtgt 
ccacccagat 
cacatggtag 
agctagagtg 
gtgtgggcaa 
ctgagtaggg 
gcatgtgtat 



gtttccctcc 
tcaactccca 
cctcctgatg 
cttaccagcc 
tgagggtaaa 
acatcaaaat 
tctggagggg 
gggtctgtct 
gtatgttatc 
gcatccgtgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gtgtgtgtat 
tgtgccattt 
aacagcaact 
tcttgggatt 
cttgagtctc 
taaccaagtc 
gggaggagga 
cccagatggg 
tacgccccca 
ccccacagac 
ctctgtcttt 
gggtggggga 
ctcgggccag 
gccaagggga 
ggtctggtcc 
tgagggagca 
tcccgtgagg 
ctttccctgc 
tatttaattt 
ggagccggct 
tgcccgtgac 
gatgccacac 
gaggagccgc 
ctcttctccc 
cctctccgag 
gcggcagctt 
tcattgggtg 
gggctgcaga 
cggaggggga 
ctgccacctt 
ccatcaatcc 
ggggaagggg 
cccacgcccc 
tcggctagag 
caaggggcgg 
gggaggggtg 
tgtccccacc 
cgagcgcgcc 
ccctgaccct 
ccggggccca 
gctgggccgg 
gagcccgcac 
gcccctcact 
gctgacccgc 
gggcgacggc 
gcctgcgccc 
tggagaagag 
agaggaggag 
caagaagcgc 
tcggcggcag 
ggacaacctg 
gacgctgcgc 
gcggccagac 
caatctggtg 
cgacggtgcc 
cccgtgctcg 
ggcgcacgcc 
aggcggtggc 
cccgctctgt 
aagctaccac 
agtcttcggc 
tatgcacctt 
ctgagacttc 
gcccccagca 
caccggtcct 
aggggcctcg 



gtgtgtgtgc 
gtatctgtgt 
tcctacctcc 
actggggttg 
atccaggaac 
tgaagaccag 
gagtgagagg 
tggtgcacag 
acaaccaagg 
acacacacgc 
cccttggttt 
ggggcagaca 
ggacgcccct 
ccctcctggg 
ctgacccacc 
tagaattggc 
gggtctctat 
gttgggcggg 
tttttttttt 
ctcccccagt 
gtgccaagcc 
tcgctccgcg 
tgcccccgct 
ccctctctct 
gtaagttgtc 
cggccgacag 
ggcatcggag 
ggaggtgtct 
tatgcgaaga 
tccccaccac 
caaaatacaa 
gtccctgacc 
gctgcttcag 
gagacgatcc 
cgcgcagcca 
agggagctgg 
ggcggccgac 
gcctcccctg 
gagaagaagc 
gcgccgggga 
ctctgggcgg 
gagactcggg 
ttgtgctgtc 
ctgttcagcg 
gaagacgacg 
gggccagggg 
gggacggagg 
gaggaagagg 
gggcccaaga 
aaggcgaacg 
cgcaaggtgg 
ctagccaaga 
ctagtgtcct 
gccggctgtc 
ggccgcttcc 
cgcctggcgg 
ctgcggaccc 
ggcgcgagcc 
ctcaatggca 
tactctatgc 
tcgtcggctg 
caccacgacc 
gcgccggctc 
gcgcagggta 
gccgctctcc 
cttcccccag 



atgagtgtgt 
ctggcccaat 
cttatcaaga 
gcttcagtat 
ttaggcagtt 
gagagaaagc 
tttctaggta 
gctcatctcg 
aaagatctcc 
ccatgcagag 
gaatttcgtt 
gggatgaggg 
tacagccatc 
gagtgggggc 
ttgggggcct 
caccacgagc 
atacaggggg 
gaggggaggg 
tttttggagt 
gcagacgcat 
catatggcct 
gttcgcatgg 
ccctgaagat 
cttgccccct 
cgaaagggag 
gagggtcctc 
gggccccagg 
ctgcctgcga 
ggccgagacg 
cgaaaaaagg 
cgagatctga 
gcagagggga 
tcctcgccgc 
tcccgcctct 
agttgcaaat 
gccggggggc 
cagcctctct 
gcaaagatat 
aggcgcgggg 
gagggcctgg 
ggcaggcggc 
ggtggaggag 
tgtctcccct 
agcccggcct 
agccgaggag 
ctccggggcc 
ccacgttggc 
aggaggaaga 
agcgcaagat 
cgcgggagcg 
tgccctgcta 
actatatctg 
acgtgcagac 
tgcagctcaa 
acggctcggg 
gcgcacagtg 
acggctactg 
cggactacaa 
acttctcact 
actactcggc 
tgcgcggggg 
gggg ccccat 

ccttcttttt 
cacccccatc 

tggggcagcg 
gggactcgcc 



gtgtgtgtgt 
gagagtgttg 
cagctgtctg 
tttcagattt 
ctcagcactg 
cgctggtgga 
ggaatccaga 
cggcccctcc 
catcggctga 
gcacagacat 
tagccacata 
atggcatggt 
ctgggagggg 
aagcacagag 
gcaggggagg 
ccccagtgtc 
caactcctcc 
cggcagaaat 
agagagtgac 
gccaatcacc 
ggcatagagg 
cgctctgaag 
gggggaacaa 
ccccccctcc 
cgagatctga 
aaatacctcc 
ttctctctcc 
tgggctcggt 
gaggacccct 
gaagcaaaca 
agagccttgt 
cggactgggc 
ccagagccgg 
ggaattgggg 
tggattaggg 
ccgggccgcg 
ccaccgccag 
ctggtcccta 
agcagccccc 
gccgagatcc 
ggaggtgggc 
ggttgtgggg 
tcccgcccgc 
tctctcggac 
cgacaagggc 
agcccgggcg 
cgaggtcaag 
aggactggac 
gaccaaggcg 
caaccgcatg 
ctccaagacg 
ggcgctctcg 
tctgtgcaag 
ctctcgcaac 
cggcccgttc 
ccaggcggcc 
cgccgcctac 
cagctccgag 
caagcaggac 
gctgcccggt 
cgtccactcg 
gtacgaggag 
cttttgcctt 
ctaccccggc 
cagtcctgtt 
ttctctctcc 



ccaaagccac 
aaaggtgagc 
acctacctcc 
ttcagaaggg 
cctgctcctc 
ctggtcacct 
cttagaccct 
ccactccacc 
ctccacagat 
ccaggcacat 
tgttgtgtgt 
gccaacatct 
gtctcagctg 
gtcctttctc 
aaatggacag 
cagccttgcc 
caccttcctc 
atttatttat 
agatggcggc 
gtctctcatg 
ctggtacccc 
acgccggcgc 
tgaaataagc 
cctcccctct 
cccgccggtt 
ttcctgggat 
cttaggggct 

ggggggggaa 

ccatggttgt 
aacaaatttg 
gggagggagt 
tcgcttctct 
ctccgggagc 
gtgcgggggt 
agcgtggggg 
agagcgcgga 
gagagaacgg 
aaacccccac 
cattcaagcg 
caggccggca 
atccagggta 
gggcgtcggt 
ggggcgccct 
gtgcccaagt 
gacgcgccgc 
gccaagccag 
gaggaaggcg 
gaggcggagg 
cgcttggagc 
cacgacctga 
cagaagctgt 
gagatcctgc 
ggtctgtcgc 
ttcctcacgg 
gccatgcacc 
ggcggcctgg 
gagacgctgt 
tacgagggcc 
tcctcgcccg 
tcgcgccacg 
gagaatctct 
ctcaatgcgt 
tgcccgcccc 
gccgggcgcg 
acctgtgggt 
ccaaggggtt 



ctcttcaacc 
cacaagataa 
ccttggccac 
gaggagaatg 
ctccctcaaa 
gtctggcagt 
cccctccacc 
ctaacatgga 
acacacatgt 
ctttcccttt 
gcgtgagggt 
acctatgggg 
tccctttgtg 
cccaacccgg 
agcgggaccc 
accccattgt 
tcaatccctg 
ttcctttatt 
gggtcccggg 
tgatagctgc 
gcctggtaga 
ccgccgcctt 
gagaagatcc 
ccccttgact 
gggaggaggg 
gatgcccccc 
gcagcccagg 
ggcaggatca 
cccaaaaagc 
gatttttccc 
cagcttgaag 
cagtctcctc 
tggggacgca 

gggggccgag 

tgagagccac 
gcggggcagc 
gctttcaggg 
ccggtccctg 
aggggcggag 
gccgggtagg 
gcctaggcag 
accccagcgc 
caggcaccat 
tcgccagctg 
caccgccacc 
tccctctccg 
agctgggggg 
gcgagcggcc 
gctccaagct 
acgcagccct 
ccaagatcga 
gctccggcaa 
agcccaccac 
agcaaggcgc 
cctacccgta 
gcggcggcgc 
atgcggcggc 
cgctcagccc 
accacgagaa 
gccacgggct 
tgtcttacga 
tttttcataa 
cctgtcccca 
gggagcgggc 
ggcctgtccc 
ccctcctcct 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 



BHC03 1 051 



-8- 

ctctcccaag gagtgcttct ccagggacct ctctccgggg gctccctgga ggcacccctc 4620 

ccccattccc aatatcttcg ctgaggtttc ctcctccccc tcctccctgc aggcccaagg 4680 

cgttggtaag ggggcagctg agcaatggaa cgcgtttccc cctctcatta ttattttaaa 4740 

aacagacacc cagctgccga ggcaaaaagg agccaggcgc tccctctttc ttgaagaggg 4800 

tagtattttg ggcgccggag cccgggcctg gaacgccctc acccgcaacc tccagtctcc 4860 

gcgttttgcg attttaattt tggcgggagg ggaagtggat tgagaggaaa gagagaggcc 492 0 

aagacaattt gtaactagaa tccgtttttc ccttttcctt tttttaaaca aacaaacata 4980 

caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagctaagag gcgacggaag ccgaacgcag 5040 

agtccggatc ggagagaaaa cgcagtaagg acttttagaa gcaataaaag gcaaaaaaaa 5100 

caaaaaacaa aaaaacaaac aaaaaaaaac cactactacc aataatcaaa gacacaaata 5160 

tctatgcaag gaggctccac tgagcctcgc ggcccggccc ggccccggga tgccccgccc 5220 

ggcctgcggg ccgccccgcc cgagcgcgga tctgtgcact ttggtgaagt gggggcccgc 52 80 

gccgccccct ccccctcccc aggttcttac aatcagtgac tcggagattt ggggccccag 5340 

tgccactgcc ctcccccgcc ccgtccccgt tgtgcgtcat gctgtttttt aaaaacctgt 5400 

ttccaaattt gtatggaatg gcaaactgtt ggggggtcgg tttggggagg gagggtttgc 54 60 

atgaaagaca cacgcacacc acaccgcacg cacaagcagg cccggcgccg gcgtccgggg 5520 

ggcagaagga ggtgagctcg ccggctcctc ctccccgcgg ccattctgtc ccctcctggg 5580 

gtgaggggtg gggatggaga cctgggggca gccccacccc tgcccggact gtgcctcggt 5640 

gggtgccacc tggcgatttc cggtgtctgg agagagtatt ttttggtcca aggagtcctc 5700 

ttggctttag ctggtgggtg ggcggggaga ggtctgaggg ctcctactgg aggttccccc 5760 

aaaaaggggc aaaaggagac cctctgccca ccggaggcag gggatcaggc atccaaatac 5820 

acgatgcaaa aatgcaatcc cacaggcgac acacccacac actcacccac acacacgcaa 5880 

ttttaccttc ctcttgtagc gaagatgaaa ctcccgtcgg acacccgaag tgcattgcgt 5940^ 

gtttctgttc agtttaatga cgattaataa atatttatgt aaatgagatg caaagccgga 6000 

ccggtttctc acggtggcct catttcattg aggggggaga gaaggtttga gctggggctg 6060 

gggtgatgaa ggcagagtgt caagtgactg tgcagaggcc aaacagaggg acttcccagc 6120 

aaaaagcact g . 6131 

<210> 7 

<211> 2020 

<212> DNA 

<213> Homo sapiens 



<400> 7 

gctactgagg ccgcggagcc ggactgcggt tggggcggga agagccgggg ccgtggctga 60 

catggagcag ccctgctgct gaggccgcgc cctccccgcc ctgaggtggg ggcccaccag 120 

gatgagcaag ctgcccaggg agctgacccg agacttggag cgcagcctgc ctgccgtggc 180 

ctccctgggc tcctcactgt cccacagcca gagcctctcc tcgcacctcc ttccgccgcc 24 0 

tgagaagcga agggccatct ctgatgtccg ccgcaccttc tgtctcttcg tcaccttcga 300 

cctgctcttc atctccctgc tctggatcat cgaactgaat accaacacag gcatccgtaa 360 

gaacttggag caggagatca tccagtacaa ctttaaaact tccttcttcg acatctttgt 42 i 

cctggccttc ttccgcttct ctggactgct cctaggctat gccgtgctgc agctccggca 480 

ctggtgggtg attgcggtca cgacgctggt gtccagtgca ttcctcattg tcaaggtcat 540 

cctctctgag ctgctcagca aaggggcatt tggctacctg ctccccatcg tctcttttgt 600 

cctcgcctgg ttggagacct ggttccttga cttcaaagtc ctaccccagg aagctgaaga 660 

ggagcgatgg tatcttgccg cccaggttgc tgttgcccgt ggacccctgc tgttctccgg 720 

tgctctgtcc gagggacagt tctattcacc cccagaatcc tttgcagggt ctgacaatga 780 

atcagatgaa gaagttgctg ggaagaaaag tttctctgct caggagcggg agtacatccg 840 

ccaggggaag gaggccacgg cagtggtgga ccagatcttg gcccaggaag agaactggaa 900 

gtttgagaag aataatgaat atggggacac cgtgtacacc attgaagttc cctttcacgg 960 

caagacgttt atcctgaaga ccttcctgcc ctgtcctgcg gagctcgtgt accaggaggt 102 0 

gatcctgcag cccgagagga tggtgctgtg gaacaagaca gtgactgcct gccagatcct 1080 

gcagcgagtg gaagacaaca ccctcatctc ctatgacgtg tctgcagggg ctgcgggcgg 1140 

cgtggtctcc ccaagggact tcgtgaatgt ccggcgcatt gagcggcgca gggaccgata 1200 

cttgtcatca gggatcgcca cctcacacag tgccaagccc ccgacgcaca aatatgtccg 1260 

gggagagaat ggccctgggg gcttcatcgt gctcaagtcg gccagtaacc cccgtgtttg 1320 

cacctttgtc tggattctta atacagatct caagggccgc ctgccccggt acctcatcca 13 80 

ccagagcctc gcggccacca tgtttgaatt tgcctttcac ctgcgacagc gcatcagcga 1440 

gctgggggcc cgggcgtgac tgtgccccct cccaccctgc gggccagggt cctgtcgcca 1500 

ccacttccag agccagaaag ggtgccagtt gggctcgcac tgcccacatg ggacctggcc 1560 

ccaggctgtc accctccacc gagccacgca gtgcctggag ttgactgact gagcaggctg 1620 

tggggtggag cactggactc cggggcccca ctggctggag gaagtggggt ctggcctgtt 1680 



BHC03 1 051 



gatgtttaca 
gcagggtctg 
ggcagcctgt 
ttttttagga 
tgggctgctg 

tgggggacct 

<210> 8 



tggcgccctg 
ggctgggcac 
cacccgtgtg 
ttattgaaag 
gccatgaatc 
ttgtattaag 



cctcctggag 
ctgacttggc 
aagatgaagg 
agtctgggac 
tctgcctctc 
ccaattaaaa 



gaccagattg 
tggggaggac 
ggctcttcat 
ccttgttggg 
ccaggctgtc 
acatgaattt 



ctctgcccca 
cagggccctg 
ctgcctgcgc 
gagtgggtgg 
cccctcctcc 



ccttgccagg 
ggcagggcag 
tctcgtcggt 
caggtggggg 
cagggcctcc 



1740 
1800 
I860 
1920 
1980 
2020 



<211> 1730 

<212> DNA 

<213> Homo sapiens 



<400> 8 

gtggtgaggg 

ttgcactctc 

gcaggacctg 

tctgtttaca 

ktgccagggg 

aggcacctgg 

cagcgtccag 

gctctactta 

agggggctat 

agagctgagc 

atggaaggat 

tccctgcctc 

tggagaaatt 

cacccttcct 

gagagcaaca 

aggaggacac 

agcgctcgcc 

agctgcccta 

ccaaggagga 

ccctgggtgg 

ctgtggtgcc 

aggaagcaca 

gctgggccct 

tagtttgccc 

ccctgagttc 

gccagcacag 

agaggtgcct 

taggcagaag 

cccccagcac 

<210> 9 



tgactgggga 
ccttctgggg 
gcagccctgg 
gacaasctgc 
ggcacaaggc 
aggcattgac 
gtaccccaac 
tagcatctga 
ttaaagggcc 
tgcgaggtgt 
ctgacactgt 
tgctccccca 
tctgggggtg 
gcccccaggt 
gctcccagga 
ccagagacat 
ctggctgatg 
ccagcgggta 
gcgtgaggac 
ccagtgtgtg 
tgtcagcaag 
gagaggctga 
tcctggctag 
agagttgggg 
ccaaagggag 
ctgaggaccc 
gggaggagtg 
aagctgggag 
cctctgttag 



ctaggcacta 
atatgccctt 
tacagagccc 
tgtcctccct 
tgggcatgtg 
cccaggacct 
ccctgccctg 
caccagaggg 
tgggagggga 
cggaggagaa 
ccacacggcc 
gagcaccctc 
ggggcaggaa 
cccagcagcc 
gctcactgcc 
gagacctacc 
atgcggatgg 
ctgccgctgc 
acccccatcc 
gaccgccagg 
cccggtgcmc 
gagggactgt 
gacctgtgga 
gctaggggag 
ggtggcagag 
tcagccccag 
gccctgatcc 
ggagggggag 
tgccgcaata 



ggcctttggt 
gagcccaggc 
agagggggca 
gcaaagggga 
gctggcatga 
tggaccccag 
ggtccggcgt 
gccgaaaata 
gagagaatga 
ctgtgagcgc 
cgaggagggg 
actgagccat 
gaatgcccca 
caggggagcc 
cctcccctct 
accagcaggg 
gcatcctcgg 
ccatcttcac 
agcttcagga 
aggtggctga 
ttcgtcgctc 
gacttgggct 
ggggcagctc 

gggggagcca 

acagtgggca 
gagaagggac 
aggaaaatgt 
gtgaaaaggg 
aatgctcaat 



gcaggcgcct 
agaggagagc 
tcagttcctg 
gtgggtgggg 
gacggtgtct 
acctctgacc 
ccccccatta 
gcccctggag 
ggagtgatca 
cgggaggcct 
tgagtgtggg 
gaggccagag 
tggggagagc 
ccccacccag 
ccccagctgc 
gcagtgccag 
ccgtgggctg 
ccctgccaag 
gctgctggcg 
gatcacaaag 
cctgtcccgc 
ccgctgtgcc 
gctggcccat 
gaggccagga 
ctaagggtgg 
aaaaggtact 
gaggggaatc 
cagaggcaag 
catgtgccag 



gaggacktgg 
acagcccagg 
ctggtcctgc 
cagagggcaa 
gagtaatgtc 
gtggggcagc 
gtgagtcttg 
aagggggagg 
tggctacctc 
tctgggcaga 
tctgctagag 
catgaagccc 
aaaggggaac 
cctgtgccca 
tccctgcatg 
gtgctggtgc 
caggagtacc 
atgggcgcca 
ctggagacag 
cagctgcccc 
tccatgtccc 
cgccccctgg 
ggctgctttg 
tgcctgagcc 
agagttgggg 
ggtgagggca 
tggaacgctc 
gatggtgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1730 



<211> 3799 

<212> DNA 

<213> Homo sapiens 



<400> 9 

ctggcactgg gtggtaacca gcaagccagc tggcatccgc atccagggtt tgtttcaatg 60 

atgtctcgtg gagaatatgg aggggctggt gccaggactg tccttggctt tgcctcgggg 120 

tgtgaacggg gtcagtgacc tctaaaacta acctgcctct cagttctgaa tccagacaga 180 

atcaatcctc agctgtgtct cgctccacac cccctgccct ggaagccagg gaaggttgga 240 

ggtgctaggg ggtcaggctc ccctctgtga cccctgcagc tgttgtggtg actcatgtcc 300 

caacctagct gcctctccca aggagacttt cccctgggac aagggggagg gaatggcatg 360 

gaggaggccc acatcaagcg gggccaggaa cccacggtgg caggagctgg gctggtgacc 420 

tacccagggc agaagggccc gggactcatc cagaggggaa ggaaggggtc ttcaggaaga 480 

ccacggagat gccacaggca gaattggctt cccatctggg agataggtgg ggagaccctg 540 



BHC03 1 051 



- 10 



gcattttgac 
ttcggaggga 
aggaccccaa 
cctccgcgaa 
cgccgccccc 
agataagcgg 
gtgcattagc 
ctgctctggg 
cggggtgggg 
tggaggaggg 

acgggtgtcc 

gcgaggggga 
gaaggggcgg 
gggaggggcc 
gtggggggcg 
cgggagcaag 
tcacctccgg 
tgcggcgcac 
cggggcgggg 
cgcggcggca 
gccccgggcc 
cgcaacaact 
aagctgcgct 
ggacaagagg 
acagaccagg 
agaagagaaa 
gggggcggag 
gagatgcata 
ctcgctgagc 
tttatttact 
ggctggagta 
gattctcttg 
ctaatttatt 
aactcccaac 
agccactgcg 
gtgaagttcc 
caatgctttt 

gggagatgca 

gaccctcttc 
tcaggcccca 
acagatttcc 
gccttcaact 
ctggggggtg 
cctgagcgta 
gataaggagc 
ccccagcccc 
ttctgcttgg 
accacgctgc 
tacctggctg 
gccctggtgc 
cttcagacag 
ctgtgagggc 
agtggcacct 
ctgggaagca 
gctttttttc 
<210> 10 



agccagaacc 

agctggaggg 

atcggctggg 

tgcgtcctgc 

gaagtggcgg 

ctgggaaccc 

acagctcacc 

gcccccggag 

agggggtgtc 

gcggggaggg 

gggagggagg 

aactggggaa 

gctgctgggc 

ggccggcggg 

cccagaggcg 

tacggagccg 

tgtgtctgca 

atggccccgg 

gtcgggcggt 

gcatgagcgg 

aggcggcggt 

acgcgccccc 

gcttggcgca 

tcgtcgggga 

cgcctaacag 

aaggagcaac 

agtgctcaga 

ggaggaaatg 

ctggctggta 

tatttttatt 

caatggcgcc 

cctcagcttc. 

tgtattttta 

cttaggatcc 

cccggccagt 

cttccgaagg 

tccagatgga 

ggggagggaa 
ctctgccctg 
ccgtgtacca 
tggaggtcaa 
ggagcatgta 
agggttgggg 
gaacagcctt 
gccagctgcg 
tgggtgctgg 
aggctgtgag 
tgaggcctgg 
gggaggccag 
gtagtggcta 
gcgtagatga 
tgtacctggt 
aataaagaaa 
gcaagggccc 
cagaagctt 



tggggtgctg 
ttgggtgcga 
ccccaaggcc 
ccctcccctg 
gaacaacccg 
gcggggcccg 
tcccctatcg 
cgaccacagc 
gctggagggc 
ggagttgtgt 
ggagcctggc 
agggcctaat 
aaaagaggtg 
ggtgaggggg 
ggactcgggg 
ggggtgtggg 
gcccggacac 
gcggctcggc 
agaaaaaagg 
cgcagaccgt 
ggcttcggcc 
tcgcggggac 
gaccttcgcc 
gtgaaagcag 
atggggacca 
atcccttagg 
at tgagagct 
gaggcagatg 
ggtatagttg 
ttttatttgt 
atctcggctc 
cctagtagct 
gtagagacgg 
acccaccccg 
aggtatagtc 
gcagcccatg 
gacacatttg 
gggtaaggag 
cccaggtgaa 
gctgctcagt 
ccgccaggag 
cagccaacat 
aggaggcttc 
gagccctgcc 
agccagggtg 
gagcccagct 
cccagatctt 
gaqaca.cc tc 
gctgacggtg 
caaggtccgg 
tgtcaagggc 
gccctgtggc 
taataccctg 
agagatctga 



agcagaatct 
gaggagtggg 
cggactgcgc 
cccaagccct 
aacccgaacc 
caggggaggc 
cgcctgccat 
ggaggccgga 
ccggtggcat 
ggcagttcta 
ggggtctggg 
tccccagtct 
aatggctgcg 
tctaaagatt 
cggggcaggc 
ggacgattgc 
taagggagat 
ggtcagctgc 
gccgcgaggc 
agccccaatg 
taccagcgct 
ctgtgcaacc 
accggtgagc 
gcgcagggaa 
agaaacaaga 
agaggggcag 
aaggtggggg 
tgggacaggg 
ttttctttct 
tttgagacgg 
actgcaacct 
gggattacag 
gacttctcca 
gcctcccaaa 
ttctagatgt 
ttggagctgg 
cagaggagaa 
gcaggggctg 
gtgtccggac 
gcctgcagcc 
ctggggcgct 
gcctgcctca 
ccatagagtg 
ttgtgcctcc 
aaacgggtcc 
cccctgcctg 
gccagctttc 
ctcctcatcg 
gtgccagtgt 
gacctccgca 
gtcttcttcg 
ccccacccac 
ccgctgcggt 
gtgtccgggt 



tcatgcctgg 
gtcagagccc 
tccccggtgg 
ctgccctcac 
ttctgtcctc 
ccggctgttc 
cggacgggca 
acggactgtc 
agcaacggac 
agggaagggt 
gcctcgtcgc 
ccacctcgaa 
gggggctgga 

gtgggggtga 
gaggcggagg 
cgctgcagcc 
ggatgaatgg 
cgcccccaca 
gagcggggca 
cgggcgcagc 
tcgagccgcg 
cgaacggcgt 

gggggaaact 

ataaaaagaa 
gatagctgag 
aggagagaga 
atgcaggaca 
gtgagaaact 
ttttctttat 
agtttcgctc 
ccgcctcccc 
gcatgcgccc 
tgttggtcag 
gtgctgggat 
gaaacctgag 
gttcagtcta 
ggaagaacta 
cctgggctgg 
gcaccctcat 
actttgagga 
ggctgcagga 
ttgagggcaa 
gctggttggg 
tgcacaggga 
tgcccatcga 
ctgacgccct 
agcgggccct 
gggccctgga 
ctgaggagga 
cctatatcat 
cctgggctca 
ctggattccc 
cagtgctgtg 
aggggagaca 



cctggccgcc 
ctacatccgc 
ccccggcggc 
ccgggtccgg 
gggagccccc 
cgcccgctaa 
gtgccgcgcc 
ctttctgggg 
gagagaggcc 
gggtgctggg 
ggagggcgct 
tcaggaaaga 
gaagagagat 
ggaactgagg 
gcgagggctg 
gccgccccac 
gtggggagga 
gcggaccggt 
ctgggcggac 
ccctgactcg 
cgcctacctc 
cgggccgtgg 
gaggcacgag 
ggaaagggag 
aggtgcaaac 
ggtggagaga 
gactgaggtg 
ccaggatttc 
tttattttca 
ttgttgccca 
gggttcaagc 
ccatgcctgg 
gctggtctcg 
tacaggtgtg 
tctcagagcg 
actctggggc 
gagagaggca 
ctggcaccag 
cgacattggt 
catcaccatg 
ggagccgggg 
ggggtaagga 
gcaacagagg 
atgctggcag 
cgtgcaccag 
ggtctctgcc 
ggaccacatc 
ggagtcgtgg 
ggtgagggag 
gcctgcccac 
gaaggttggg 
tgttctttga 
tgtggctctc 
ttcaccctag 



600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860^ 
192CW 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
318C 
324< 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3799 



<211> 4530 



<212> DNA 



<213> Homo sapiens 



BHC03 1 051 



11 - 



<400> 10 
aattctcgag 
cccggccccc 
agccatgggg 
ctcctcctcg 
atgaagctgc 
cagggctgcc 
ctgtccttcc 
gtgaggcagg 
aactatgccc 
ggggcctccc 
ggaggggtct 
gacatcttcc 
gcctgccacc 
gattgtcaga 
ctgcccactg 
gactgcctgg 
ctggtcacct 
ttcggcgcca 
tgcaccctcg 
tgtgagaagt 
cgagaggtga 
tttgggagcc 
ccgctccagc 
tacatctcag 
atccggggac 
agctggctgg 
aacacccacc 
caagctctgc 
tgccaccagc 
tgcagccagt 
cccagggagt 
aatggctcag 
aaggaccctc 
atgcccatct 
acccactcct 
ctgacgtcca 
tttgggatcc 
ctgcaggaaa 
cagatgcgga 
tttggcacag 
gccatcaaag 
gcatacgtga 
acatccacgg 
cgggaaaacc 
aaggggatga 
gtgctggtca 
gacattgacg 
ctggagtcca 
actgtgtggg 
atccctgacc 
gtctacatga 
gagttggtgt 
aatgaggact 
gacgatgaca 
ttctgtccag 
tctaccagga 
cccaggtctc 
ggaatggggg 
cggtacagtg 
ctgacctgca 
tcgccccgag 
aagactctct 
gtggagaacc 
cctgccttca 
ggggctccac 



ctcgtcgacc 
acccctcgca 
ccggagccgc 
ccctcttgcc 
ggctccctgc 
aggtggtgca 
tgcaggatat 
tcccactgca 
tggccgtgct 
caggaggcct 
tgatccagcg 
acaagaacaa 
cctgttctcc 
gcctgacgcg 
actgctgcca 
cctgcctcca 
acaacacaga 
gctgtgtgac 
tctgccccct 
gcagcaagcc 
gggcagttac 
tggcatttct 
cagagcagct 
catggccgga 
gaattctgca 
ggctgcgctc 
tctgcttcgt 
tccacactgc 
tgtgcgcccg 
tccttcgggg 
atgtgaatgc 
tgacctgttt 
ccttctgcgt 
ggaagtttcc 
gtgtggacct 
tcgtctctgc 
tcatcaagcg 
cggagctggt 
tcctgaaaga 
tctacaaggg 
tgttgaggga 
tggctggtgt 
tgcagctggt 
gcggacgcct 
gctacctgga 
agagtcccaa 
agacagagta 
ttctccgccg 
agctgatgac 
tgctggaaaa 
tcatggtcaa 
ctgaattctc 
tgggcccagc 

tgggggacct 

accctgcccc 
gtggcggtgg 
cactggcacc 
cagccaaggg 
aggaccccac 
gcccccagcc 
agggccctct 
ccccagggaa 
ccgagtactt 
gcccagcctt 
ccagcacctt 



ggtcgacgag 
gcaccccgcg 
agtgagcacc 
ccccggagcc 
cagtcccgag 
gggaaacctg 
ccaggaggtg 
gaggctgcgg 
agacaatgga 
gcgggagctg 
gaacccccag 
ccagctggct 
gatgtgtaag 
cactgtctgt 
tgagcagtgt 
cttcaaccac 
cacgtttgag 
tgcctgtccc 
gcacaaccaa 
ctgtgcccga 
cagtgccaat 
gccggagagc 
ccaagtgttt 
cagcctgcct 
caatggcgcc 
actgagggaa 
gcacacggtg 
caaccggcca 
agggcactgc 
ccaggagtgc 
caggcactgt 
tggaccggag 
ggcccgctgc 
agatgaggag 
ggatgacaag 
ggtggttggc 
acggcagcag 
ggagccgctg 
gacggagctg 
catctggatc 
aaacacatcc 
gggctcccca 
gacacagctt 
gggctcccag 
ggatgtgcgg 
ccatgtcaaa 
ccatgcagat 
gcggttcacc 
ttttggggcc 
9999gagcgg 
atgttggatg 
ccgcatggcc 
cagtcccttg 
ggtggatgct 
gggcgctggg 
ggacctgaca 
ctccgaaggg 
gctgcaaagc 
agtacccctg 
tgaatatgtg 
gcctgctgcc 
gaatggggtc 
gacaccccag 
cgacaacctc 
caaagggaca 



ctcgagggtc 
ccccgcgccc 
atggagctgg 
gcgagcaccc 
acccacctgg 
gaactcacct 
cagggctacg 
attgtgcgag 
gacccgctga 
cagcttcgaa 
ctctgctacc 
ctcacactga 
ggctcccgct 
gccggtggct 
gctgccggct 
agtggcatct 
tccatgccca 
tacaactacc 
gaggtgacag 
gtgtgctatg 
atccaggagt 
tttgatgggg 
gagactctgg 
gacctcagcg 
tactcgctga 

ct gggcagtg 

ccctgggacc 
gaggacgagt 

tggggtccag 

gtggaggaat 
ttgccgtgcc 
gctgaccagt 
cccagcggtg 
ggcgcatgcc 
ggctgccccg 
attctgctgg 
aagatccgga 
acacctagcg 
aggaaggtga 
cctgatgggg 
cccaaagcca 
tatgtctccc 
atgccctatg 
gacctgctga 
ctcgtacaca 
attacagact 

gggggcaagg 

caccagagtg 
aaaccttacg 
ctgccccagc 
attgactctg 

a gggaccccc 

gacagcacct 
gaggagtatc 
ggcatggtcc 
ctagggctgg 
gctggctccg 
ctccccacac 
ccctctgaga 
aaccagccag 
cgacctgctg 
gtcaaagacg 
ggaggagctg 
tattactggg 
cctacggcag 



gacgagctcg 
tcccagccgg 
cggccttgtg 
aagtgtgcac 
acatgctccg 
acctgcccac 
tgctcatcgc 
gcacccagct 
acaataccac 
gcctcacaga 
aggacacgat 
tagacaccaa 
gctggggaga 
gtgcccgctg 
gcacgggccc 
gtgagctgca 
atcccgaggg 
tttctacgga 
cagaggatgg 
gtctgggcat 
ttgctggctg 
acccagcctc 
aagagatcac 
tcttccagaa 
ccctgcaagg 
gactggccct 
agctctttcg 
gtgtgggcga 
ggcccaccca 
gccgagtact 
accctgagtg 
gtgtggcctg 
tgaaacctga 
agccttgccc 
ccgagcagag 
tcgtggtctt 
agtacacgat 
gagcgatgcc 
aggtgcttgg 
agaatgtgaa 
acaaagaaat 
gccttctggg 
gctgcctctt 
actggtgtat 
gggacttggc 
tcgggctggc 
tgcccatcaa 
atgtgtggag 
atgggatccc 
cccccatctg 
aatgtcggcc 
agcgctttgt 
tctaccgctc 
tggtacccca 
accacaggca 
agccctctga 
atgtatttga 
atgaccccag 
ctgatggcta 
atgttcggcc 
gtgccactct 
tttttgcctt 
cccctcagcc 
accaggaccc 
agaacccaga 



agggcgcgcg 
gtccagccgg 
ccgctggggg 
cggcacagac 
ccacctctac 
caatgccagc 
tcacaaccaa 
ctttgaggac 
ccctgtcaca 
gatcttgaaa 
tttgtggaag 
ccgctctcgg 
gagttctgag 
caaggggcca 
caagcactct 
ctgcccagcc 
ccggtataca 
cgtgggatcc 
aacacagcgg 
ggagcacttg 
caagaagatc 
caacactgcc 
aggttaccta 
cctgcaagta 
gctgggcatc 
catccaccat 
gaacccgcac 
gggcctggcc 
gtgtgtcaac 
gcaggggctc 
tcagccccag 
tgcccactat 
cctctcctac 
catcaactgc 
agccagccct 

gggggtggtc 

gcggagactg 
caaccaggcg 
atctggcgct 
aattccagtg 
cttagacgaa 
catctgcctg 
agaccatgtc 
gcagattgcc 
cgctcggaac 
tcggctgctg 
gtggatggcg 
ttatggtgtg 
agcccgggag 
caccattgat 
aagattccgg 
ggtcatccag 
actgctggag 
gcagggcttc 
ccgcagctca 
agaggaggcc 
tggtgacctg 
ccctctacag 
cgttgccccc 
ccagccccct 
ggaaagggcc 
tgggggtgcc 
ccaccctcct 
accagagcgg 
gtacctgggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



BHC03 1 051 



ctggacgtgc 
gagcagggaa 
caggggaacc 
cagatggctg 
gattctgagg 
ccctttcctt 
cggccctaag 
agtactgccc 
tgcttttctg 
caggggagaa 
ttgtccattt 
<210> 11 



cagtgtgaac 
ggcctgactt 
tgccatgcca 
gaaggggtcc 
ccctgcccaa 
ccagatcctg 
ggagtgtcta 
cccatgagga 
tttagttttt 
tgggtgttgt 
gcaaatatat 



cagaaggcca 
ctgctggcat 
ggaacctgtc 
agcctcgttg 
tgagactcta 
ggtactgaaa 
agaacaaaag 
aggaacagca 
actttttttg 
atggggaggc 
tttggaaaac 



- 12- 

agtccgcaga 
caagaggtgg 
ctaaggaacc 
gaagaggaac 
gggtccagtg 
gccttaggga 
cgacccattc 
atggtgtcag 
ttttgttttt 
aagtgtgggg 



agccctgatg 
gagggccctc 
ttccttcctg 
agcactgggg 
gatgccacag 
agctggcctg 
agagactgtc 
tatccaggct 
ttaaagacga 
ggtccttctc 



tgtcctcagg 
cgaccacttc 
cttgagttcc 
agtctttgtg 
cccagcttgg 
agaggggaag 
cctgaaacct 
ttgtacagag 
aataaagacc 
cacacccact 



<211> 2205 

<212> DNA 

<213> Homo sapiens 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4530 



<400> 11 
cacagggctc 
tgtagagaag 
gccagacttc 
acaactggtt 
cctcatctta 
ccccggcccc 
ccaaccaccg 
cccaacccct 
tccagtgcaa 
agtgaggatg 
tgtgaaatgc 
tgccaccccc 
caggctgcct 
tacgaactgt 
ctcgatgcac 
agctttcctg 
cgctttttct 
gatccgaggc 
cagggccgca 
cttcgaaatg 
tgctggctgg 
caggcacagt 
tcagataata 
ccccgggagg 
caccgcctca 
scccssctct 
cagggcttgg 
gtcctctctt 
gagggtcgcc 
ctcgtggagt 
acgcgggtgg 
gcccgccgcc 
ggagcgggag 
aaaacagccc 
ccccctctcc 
tgggggcagc 
ctgctctgtt 
<210> 12 



ccccccgcct 
cgggagtgga 
ggaacgggtg 
ccgttaagcc 
gcagctctcc 
ctgatacccc 
gcaggaaact 
tccctgagct 

gggggctgct 

gggcctgcag 
tggtgcagcg 
acctagcact 
ggcccgtggg 
tcaagagctc 
acactggtat 
agatccaggg 
gtttcttgcg 
acctgcagta 
agctctacgg 
gacacaaggg 
ctgccttccg 
ctcgccatct 
ccctggtggc 
ctctgagtgt 
gcctgcccat 
aat-tccacaq 



tagacggcct 
tgtgccacct 
tgtacttcag 
tccaccagct 
ccctctgacc 
cctccaccca 
ggttccgcca 
ccactccagt 
ttctcctagc 
ccaggcggtt 
ttatactagt 



ctgacttctc 
tctgaaataa 
tcctgctact 
cctctctcgc 
ggaagacctt 
tctgcctgag 
tcgagaggag 
ctgcagtcct 
cccccgcgat 
gtctgtggag 
agctcacgcc 
ggagcggggt 
cggagatagc 
cccacactcc 
atcccatgaa 
ctttctgcag 
ccgatctggc 
cgtggcagat 
gatgcccact 
gcttcggatc 
cctcttcaag 
gcatccatct 
catggacttc 
ggccctggag 
gccagcctcc 
gcgcatttcc 
gttcctggtc 
gcagaaagtg 
catggatgat 
gaaccgcggc 
aggccgtgga 
tccagtggac 
ctccagtttt 
ccactcctga 
tctggaggtg 
tcacgcccca 
gacaataaag 



tgtccgaagt 
aatccaggaa 
cctgctgggg 
tcagacgcca 
tggccagccc 
gaggtaaaga 
gagaggcgtg 
ccctcacaga 
gccagccgcc 
gtggcagcag 
ttgagcgacg 
ttggaggacc 
cgcttcgtct 
ctgttcccag 
gacctcatcc 
ctgcggggtt 
ctctattact 
gtgaacgagt 
gacttcggtt 
ttctgcagtg 
tacggggtgc 
tgtttgggct 
tctggccatg 
gaggcccagg 
ggcacgagcc 
cgtgaggaga 
cgggagagtc 
aagcattatc 
ggccagaccc 
atcctgccgt 
ctggctcatg 
tctggggcgc 
ctcctctgct 
cccctctcct 
ctgctctagg 
cactttgtac 
attatttttt 



cgggacaccc 
tctgggggtt 
ctcctccagg 
tggagctgga 
ctgggacccc 
ggtcccagcc 
ccacctccct 
gcccaattct 
cccatgtagt 
gtgccacagc 
agacctgggg 
acgagtccgt 
tccggaaaaa 
aaaaaatggt 
agaacttcct 
caggacggaa 
ccaccaaggg 
ccaacgtgta 
tctgtgtcaa 
aagatgagca 
agctgtacaa 
ccccaccctt 
ctgggcgtgt 
cctggaggaa 
tcagtgcagc 
gccagcggct 
agcggaaccc 
tcatcctgcc 
gcttcactga 
gcttgctgcg 
cctcagcccg 
ggccacaggg 
tctttgcctc 
caagggaagg 
gcagggaatt 
agaccgagag 
gatac 



tcctaccacc 



cctagacgga 
acaagggcac 
tctgtctcca 
tcctgggact 
tctcctcatc 
cccctctatc 
cgggggcccc 
aaaggtgtac 
tcgccacgtg 
gctggtggag 
ggtggaagtg 
cttcgccaag 
ctccagctgt 
gaatgctggc 
gctttggaaa 
cacctctaag 
cgtggtgacg 
gcccaacaag 
gagccgcacc 
gaattaccag 
gagaagtgcc 
cattgagaac 
gaagacaaac 
catccaccgc 
tattggacag 
ccagggctcc 
gagcgaggag 
cctgctgcag 
ccattgctgc 
ccttcaggct 
gacgggatga 
cctcagatag 
ccttgggtgg 
atgggagaag 
gccagttgat 



60^ 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
138A 
144^^ 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2205 



<211> 2177 



<212> DNA 



<213> Homo sapiens 



BHC03 1 051 



- 13- 



<400> 12 

gaattcgcgg 

ctcgtgtgca 

gcctcggtcc 

gcgccgcggc 

gccgccggcc 

gggggaggca 

cacagtcacc 

caaggagccg 

caaccactat 

agactttttg 

tccccgcacg 

actcctcgtg 

tgatgatctg 

aaagtgttac 

cagcttcttg 

caccctgttg 

gaagctggtg 

cctctactac 

gatgaccaac 

gcacaagctt 

ccgccagccc 

gacaggaaac 

agatgggacc 

catgatcagc 

ggatagcccc 

ggccagcatc 

cacccgagag 

catgtctgtc 

agaggaacgg 

tgatgatgac 

gctctttccc 

tcatatgctg 

cagggctcct 

tccccgggca 

ctcctttagg 

tcgattatga 

aaagcggccg 

<2X0> 13 



ccgctggttt 
ggcccggctc 
ccggactagg 
gcggacaagg 
ccccaggatg 
gacggcaaga 
ttggaggaca 
agattcgtgc 
gttctgtata 
ctccccttcc 
ggaaaagctg 
gtcatcttca 
atgcagaaga 
tattatcacg 
catgctcggc 
aacctcctgc 
tccaagtctg 
acagggcgaa 
gcccttcgca 
ctcatcgtgg 
tccctcaagc 
ctagccaagt 
tacaccctaa 
ctctcctatt 
gaagatgcag 
aaccacgaga 
ccccagctag 
aaggccatga 
cgtgagcgag 
agcttccctt 
ccttgggggt 
cattcgtgca 
ccc cage egg 
gggtcctggc 
gagtggggga 
tttttaaaca 
cgaattc 



gcagctgctc 
ggctcctggt 
ccgtgacccc 
cgaaaccgcc 
tggagatgaa 
cggcggcggc 
tcaaggagca 
tgcgggccct 
aggctgtgca 
tggaagagee 
cgtcgacacc 
tgatgaacag 
tcagtactca 
cccgggtcta 
tccggacagc 
tgeggaatta 
tgttcccaga 
teaaagecat 
aggcccctca 
tggagctgtt 
gctcactcat 
tcaaccaggt 
ttatcegget 
cccgaatctc 
agttcattgt 
agggctatgt 
ccttccacca 
ggtttcctcc 
aacagcagga 
gagctggggg 
cccctgccca 
gggggtgggg 

tgacttactg 
cagcagtgtg 
ctagaactgg 
ataaaaagtt 



cgtcatcgtg 
ccccggtgcg 
gggtgccatg 
gcccggcgga 
agaggaggca 
ageggttgag 
cgtgaaacag 
gcggatgctg 
gggcttcttc 
catggacaca 
cctcctgcct 
caagcgctac 
gaaccgccgg 
tgagttcctg 
tacgettegg 
cctacactac 
gcaggccaac 
ccagctggag 
gcacacagct 
gctgggggag 
gccctatttc 
cctggatcag 
gcggcacaac 
cttggctgac 
tgccaaggcc 
ccaatccaag 
gcgcatctcc 
caaategtae 
cttggagttt 
gctggggagg 
gggcactgtc 
gtgctgggag 
tacagcaggc 
ggagcaggag 
gatgtcttgg 
ctcccaaaaa 



cggcccgacg 
agggttaacg 
aagcaggagg 
ggagaacaag 
gcgacgggtg 
cactcccagc 
ctagagaaag 
ccttccacat 
acttcaaata 
gaggctgatt 
gaagtggaag 
aaagaggcac 
gccctagacc 
gacaagctgg 
catgaegcag 
agcttgtacg 
aacaatgagt 
tactcagagg 
gteggcttea 
atccctgacc 
cttctgactc 
tttggggaga 
gtgattaaga 
atcgcccaga 
ateegggatg 
gagatgattg 
ttctgectag 
aacaaggact 
gecaaggaga 
ggtaggggga 
cccattttcc 
ccagccaccc 
aggagggtgg 
gggaaggata 
cttgtatgtt 
aaaaaaaaaa 



ctatctcgcg 
cgaggccccg 
gctcggcgcg 
aacccccacc 
gegggtcaac 
gagagctgga 
eggtttcagg 
cacgccgcct 
atgccactcg 
tacagttccg 
cctatctcca 
agaagatctc 
ttgtagccgc 
atgtggtgcg 
aegggcagge 
accaggctga 
gggccaggta 
cccggagaac 
aacagaeggt 
ggctgcagtt 
aagctgtcag 
agtttcaagc 
caggtgtacg 
agetgeagtt 
gtgtcattga 
acatctattc 
atatccacaa 
tggagtctgc 
tggcagaaga 
atggggacag 
cacacacagc 
tgacctcccc 
gcaggcaacc 
gttctgtgta 
ttttgaagct 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
.1920 
1980 
2040 
2100 
2160 
2177 



<211> 2960 

<212> DNA 

<213> Homo sapiens 



<400> 13 

ctgccgcttc caggegtcta tcagcggctc agcctttgtt cagctgttct gttcaaacac 60 

tetggggeca ttcaggcctg ggtggggcag egggaggaag ggagtttgag gggggcaagg 120 

egaegtcaaa ggaggatcag agattccaca atttcacaaa actttegcaa acagcttttt 180 

gttccaaccc ccctgcattg tcttggacac caaatttgea taaatcctgg gaagttatta 240 

etaagectta gtcgtggccc caggtaattt cctcccaggc ctccatgggg ttatgtataa 3 00 

agggccccct agagctgggc cccaaaacag cccggagcct gcagcccagc cccacccaga 360 

cccatggctg gacctgccac ccagagcccc atgaagctga tgggtgagtg tcttggccca 420 

ggatgggaga gccgcctgcc ctggcatggg agggaggctg gtgtgacaga ggggctgggg 4 80 

atccccgttc tgggaatggg gattaaaggc acccagtgtc cccgagaggg cctcaggtgg 540 

tagggaacag catgtctcct gagcccgctc tgtccccagc cctgcagctg ctgctgtggc 600 

acagtgeact ctggacagtg caggaageca cccccctggg ccctgccagc tccctgcccc 660 

agagcttcct gctcaagtgc ttagagcaag tgaggaagat ecagggegat ggcgcagcgc 720 

tccaggagaa gctggtgagt gaggtgggtg agagggctgt ggagggaagc ccggtgggga 780 

gagctaaggg ggatggaact geagggecaa catcctctgg aagggacatg ggagaatatt 840 



BHC03 1 051 



14- 



aggagcagtg 
cagtgctcgg 
agggcccctg 
aggacatgga 
ccggccacgg 
gccaccccga 
gcagctgccc 
gagggggaag 
gggcctgacg 
gcttgagcca 
ggatctcccc 
ccaccaccat 
ttctgggcac 
ttcctcattt 
acagatggaa 
cttcgcctct 
cttcctggag 
cccatcccat 
acagggaaga 
attctcctgc 
tagataggta 
gggcactggg 
agagtatcag 
gttccccatc 
tgcatcccct 
gccctggggt 
atggtttgac 
cctgccctgc 
gacatttgcc 
gtcaggcctg 
accagtgtcc 
ccagtcacgt 
ggctgaaggg 
ggaggaggag 
gagcacctac 
gacatggaat 
<210> 14 



gagctgggga 
gagggctggc 
tgagatcaga 

999 a 9999aa 
cgagtctcac 
ggagctggtg 
cagccaggcc 
gagaggagga 
tatctcaggc 
actccatagc 
cgagttgggt 
ctggcagcag 
cacagccggg 
gtaataacgc 
gaactgggaa 
gctttccagc 
gtgtcgtacc 
gtatttatct 
gcagaacgga 
ctgtagcagt 
aataccaagt 
atgagccgct 
gtctcccacg 
tgggtccttg 
tggctgtgag 
cccacgaatt 
tcccgaacat 
ccccacgagg 
ttgctggatg 
tgtgtgaaag 
cctccactgt 
ccttcctcct 
tgggagaggc 
gaaagttctc 
tctgtgcaga 
ctgcactcga 



aggctgggaa 
tgggatggga 
gagtgggggt 
agaccagaga 
tcagcatcct 
ctgctcggac 
ctgcagctgg 
acacccatgg 
agcaccccct 
ggccttttcc 
cccaccttgg 
gtgagccttg 
cctgtgtatg 
ccactcagaa 
tggcccctgc 
gccgggcagg 
gcgttctacg 
ctatttaata 
gccccaggcc 
gagaaaaagc 
atttattact 
gtgagcccct 
tgggagacaa 
cacccctcac 
gcccctggac 
tgctggggaa 
caccgacgtg 
gtcaggactg 
gggactgggg 
gaagctccac 
cacattgtaa 
tcttgagtcc 
cagagggagg 
aagttcgtct 
cgctgggcta 



gggacttggg 
gtggaggcat 
gcagggcaga 
gtcggggagg 
tccatcccca 
actctctggg 
tgagtgtcag 
gctcccccat 
aactcttccg 
tctaccaggg 
acacactgca 
ttgggcaggg 
ggccctgtcc 
gggcccaacc 
cctgcagccc 
aggggtcctg 
ccaccttgcc 
tttatgtcta 
tctgtgtcct 
tcctgtcctc 
atgactgctc 
ggtcctgagg 
gaaatccctg 
tctggcctca 
aagcagaggt 
tctcgttttt 
tctcctgttt 
tgactctttt 
atgtgggagg 
tgtcaccctc 
ctgaacttca 
agctggtgcc 
tcggggagga 
gacattcatt 
agtgctgggg 



gaggaggacc 
cacattcagg 
gaggaactga 
acccgggaag 
gtgtgccacc 
catcccctgg 
gaaaggataa 
gtctccaggt 
ctctgtctca 
gctcctgcag 
gctggacgtc 
tggccaaggt 
atgctgtcag 
actgatcaca 
acccagggtg 
gttgcctccc 
cagccctgag 
tttaagcctc 
tccctgcatt 
ccatcccctg 
cccagccctg 
gtccccacct 
tttaatattt 
gccgactgca 
ggccagagct 
cttcttaaga 
ttctgggtgg 
tagggccagg 
gagcagacag 
cacctcttca 
ggataataaa 
tggccagggg 
ggtctgggga 
ccgttagcac 
acacagcagg 



ttggtgggga 
agaaagggca 
acagcctggc 
gagcggcgac 
tacaagctgt 
gctcccctga 
ggctaatgag 
tccaagctgg 
caggcaggct 
gccctggaag 
gccgactttg 
cgtgctggca 
cccccagcat 
gctttccccc 
ccatgccggc 
atctgcagag 
ccaagccctc 
atatttaaag 
tctgagtttc 
gactgggagg 
gctctgcaat 
gggacccttg 
aaacagcagt 
cagcggcccc 
gggaggcatg 
cttttgggac 
cctcgggaca 
caggtgcctg 
gaggaatcat 
ccccccactc 
gtgtttgcct 
ctggggaggt 
ggaggtccag 
atatttatct 
gaacaaggca 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
222o( 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2960 



<211> 850 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> mis cofeature 

<222> (3) . . (4) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (9) . . (9) 

<223> n=a, c, g or t 



<220> 
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<221> misc_f eature 
<222> (ii) . . (ii) 
<223> n=a, c, g or t 

<220> 

<221> misc_f eature 

<222> (18).. (18) 

<223> n=a # c, g or t 

<220> 

<221> misc_f eature 

<222> (202) . . (202) 

<223> n=a, c, g or t 

<220> 

< 2 2 1 > mi s cofeature 

<222> (205) . . (205) 

<223> n=a, c, g or t 

<220> 

<221> misc^feature 

<222> (273) . . (273) 

<223> n=a, c, g or t 

<220> 

< 2 2 1 > mi sc_f eature 

<222> (327) . . (327) 

<223> n=a, c, g or t 

<220> 

<221> misc_f eature 

<222> (367) . . (367) 

<223> n=a, c, g or t 

<220> 

<221> misc_f eature 
<222> (581) . . (581) 
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<223> n*=a, c, g or t 
<220> 

<221> misc_feature 
<222> (599) . . (599) 

<223> n=a, c, g or t 

<220> 

<221> mis cofeature" 

<222> (628) . . (628) 

<223> n=a, c, g or t 

<220> 

<2 2 1 > mi s c_f ea ture 

<222> (673) ; . (673) 

<223> n=a, c, g or t 

<220> 

<22l> misc_feature 

<222> (675) . . (675) 

<223> n=a, c, g or t 

<220> 

<221> misc_feature 

<222> (682) . . (682) 

<223> n=a, c, g or t 

<220> 

<221> misc_feature 

<222> (693) . . (693) 

<223> n=a, c, g or t 

<220> 

<221> mis cofeature 

<222> (698) . . (698) 

<223> n=a, c, g or t 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 



misc_feature 
(700) . . (700) 
n=a, c, g or t 



misc_f eature 
(720) . . (720) 
n=a, c, g or t 



misc_f eature 
(730) . . (730) 
n=a, c, g or t 

mi sc_f eature 
(734) . . (734) 
n=a, c, g or t 

misc_f eature 
(742) . . (743) 
n=a, c, g or t 

mi s cofeature 
(746) . . (746) 
n=a # c, g or t 

misc_f eature 
(748) . . (748) 
n=a, c, g or t 
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- 18- 

<221> misc_f eature 

<222> (752) . . (752) 

<223> n=a, c, g or t 

<220> 

< 2 2 1 > mis cofeature 

<222> (762) . . (762) 

<223> n»a, c, g or t 

<220> 

<2 2 1 > misc_f eature 

<222> (767) . . (767) 

<223> n=a, c, g or t 

<220> 

<221> misc_f eature 

<222> (777) . . (777) 

<223> n=a, c, g or t 

<220> 

<2 2 1 > misc_f eature 

<222> (783) . . (784) 

<223> n=a, c, g or t 

<220> 

<221> misc_f eature 

<222> (789) . . (789) 

<223> n=a, c, g or t 

<220> 

<221> misc_feature 

<222> (794) . . (794) 

<223> n=a, c, g or t 



<220> 
<221> 
<222> 



mis cofeature 
(797) . . (798) 
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<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



n=a, c, g or t 



mi sc_f eature 
(803) . . (805) 
n=a, c, g or t 



misc_f eature 
(810) . . (810) 
n=a, c, g or t 



misc_f eature 
(817) . • (817) 
n=a, c, g or t 



mis cofeature 
(826) . . (827) 
n=a, c, g or t 



mis cofeature 
(831) . . (832) 
n=a, c, g or t 



mi s c_f e ature 
(834) . . (834) 
n=a, c, g or t 



misc_f eature 
(837) . . (838) 
n=a, -C, g or t 



BHC 03 1 051 



-20 



<220> 

< 2 2 1 > mi s c_ f ea tur e 
<222> (840) . . (840) 
<223> n=a, c, g or t 

<220> 

< 2 2 1 > mi s c_f ea t ur e 
<222> (844) . . (844) 
<223> n=a, c # g or t 

<220> 

<221> misc_f eature 

<222> (846) . . (848) 

<223> n=a, c, g or t 



<400> 14 

ttnnctttnt 

ggaccctctg 

tcgcaccgct 

cctggagcag 

actggtgaag 

cctgcccctg 

tggccgnagt 

gaaagatgag 

ccagttcccc 

tttcctctcc 

ggtcacccct 

ttggatggaa 

gggcaatatn 

aanntcttna 

tccncnnncc 

<210> 15 



ngccatgncc 
aaccgagtcc 
ggcccccaca 

ggtggccgtg 
gtgtcagcca 
ggccgccagg 

gggggctggg 
gcatcgtgcc 
gctcctgact 
atcttgctgg 
gtgttgcgcc 
agngngtttt 
ttangggaat 
tccnctnnct 



agttcaactc 
tggccaacct 
cccagttcgt 
gnagngtcct 
tgtctagccc 
tggctgntaa 
gactggcgca 
tcacatccgt 
cctgactcca 
caactgagga 
cgcccagncc 
tntgtgttgt 
cnnccntncg 
ttnnnctccn 



agcctctcag 
gttcctgctc 
gcagtggttc 
gcagttcatg 
canggtggtt 
agccattgct 
gccccaggcg 
tccacatggt 
ggatgtctcc 
gagtgagcag 
aggagtagtc 
gtntgtgnan 
cncatttttt 
tcacctncct 



ttccacacgg 
atctcctcca 
atggaggagt 
cccttcacca 
ctggccatca 
gcactctgag 
cctccaaggg 
gcaagagcct 
cggtttcttc 
nctggaccac 
ttacctcttg 
gtgtttttcg 
cnttagagct 
tctttnntct 



<211> 2309 

<212> DNA 

<213> Homo sapiens 



acaacatgcg 
tcctggggtc 

gtgtggactg 

ccgtgtcgga 
cggacctcag 
gggcttggca 
aagcagtgag 
ctagcggctt 
ttttcaaaat 
aagcccagng 
aggaactttc 
gggttttttn 
ccccggngga 
nntnttnncn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
85CL. 



<400> 15 

ccccgggcgc aggaggcggg cggcccggcc ccaccggccc cccatggacg cccccagcac 60 

ggggcgctga gacccccgcg tcgctgccca gcccggtccg gcgcgccacg ccagggatct 120 

ctggacagga caagactccg aagctactcc cccagcacac agcccgggac ccacaaaccc 180 

agcttgcccc cagccctccc acctgccact ccctggcccc tcccaccgcc cgcccccctt 240 

ggggcgcagg gcatggtgtg aaaggccaag tgctgaggcg ggtatcatgg gtgctgtgcc 300 

ctagggcctg ggtggcaggg ggtgggtggc ctgtgggtgt gccggggggg ccagtgtgcc 3 60 

caccccagtc tcttggcgtg ctggagggca tcctggatgg aattgaagtg aatggaacag 420 

aagccaagca aggtggagtg tgggtcagac ccagaggaga acagtgccag gtcaccagat 480 

ggaaagcgaa aaagaaagaa cggccaatgt tccctgaaaa ccagcatgtc agggtatatc 540 

cctagttacc tggacaaaga cgagcagtgt gtcgtgtgtg gggacaaggc aactggttat 600 

cactaccgct gtatcacttg tgagggctgc aagggcttcfc ttcgccgcac aatccagaag 660 
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aacctccatc 
cgcaatcagt 
ttggttctag 
cggcggcgga 
gagtgggatc 
cattggaaac 
atgccggacg 
ccggccatca 
tgcgaagacc 
gctgtccgct 

cgggagcagc 

aagtcactct 
ctaatgtcaa 
gcgtacctgc 
tggcccaagc 
gcggcagaag 
ggaatgcatg 
ggaagtctcc 
ctggagctgc 
agcagtgagg 
gcaggcaatg 
gccatacctt 
gggctcagca 
caaacatggc 
ccccctttct 
ctccttcccc 
actgtgacac 
gactgtgtct 
<210> 16 



ccacctattc 

gccagctgtg 

atgactcgaa 

aggaggagat 

tgatccacat 

agaggcggaa 

gagacaaggt 

cccgtgtggt 

agatcatcct 

acgaccctga 

tcaagaatgg 

ctgcctttaa 

cagaccgctc 

tggcgttcga 

tgctgatgaa 

gccggccggg 

ttgttcaggg 

aagggccggt 

tccaccgaag 

cggactcccc 

cagcctctcc 

ctcccaggcc 

gggctggtca 

cagactccct 

ctgcctggca 

agctccccca 

ttatctcact 

gaatcatgtc 



ctgcaaatat 

ccgcttcaag 

gcgggtggcc 

gatccgatca 

tgccacagag 

attcctgccc 

ggacctggaa 

ggactttgcc 

cctgaagggg 

gagcgacacc 

cggcctgggc 

cctggatgac 

gggcctgctg 

gcactacgtc 

ggagagagaa 

cgggtcactg 

tccgcaggtc 

tcttcagcac 

cggaattctc 

gagctcctct 

ctgaagcccc 

tctgccccaa 

cctcccatcc 

tgctttttgc 

acatcttact 

ggcagaaata 

gttttataat 

tgtatcccg 



gacagctgct 

aagtgcatcg 

aagcgtaagc 

ctgcagcagc 

gcccatcgca 

gatgacattg 

gccttcagcg 

aaaaaactgc 

tgctgcatgg 

ctgacgctga 

gtagtctccg 

acggaagtgg 

tgtgtggaca 

aaccaccgca 

gtgcagagtt 

ggcgtccacc 

cggcagcttg 

cagagcccga 

catgcccgag 

gaggaggaac 

ccagaaggcc 

gagcaggagg 

cgtaagacca 

tgtgtagttc 

tgtcctttga 

gttgtctgtg 

tagtcgggca 



gtgtcattga 

ccgtgggcat 

tgattgagca 

gaccagagcc 

gcaccaatgc 

gccagtcacc 

agtttaccaa 

ccatgttctc 

agatcatgtc 

gtggggagat 

acgccatctt 

ctctgctgca 

agatcgagaa 

aacacaacat 

cgattctgta 

cggaaggaca 

agcagcagct 

agagcccgca 

cggtctgtgg 

cggaggtctg 

gatggggaag 

tgcctgaaag 

ccttcccttc 

cctctgcctg 

ggccccaact 

cttccttggt 

tgagtctgtt 



caagatcacc 

ggccatggac 

gaaccgggag 

cactcctgaa 

ccagggcagc 

cattgtctcc 

gatcatcacc 

cgagctgcct 

cctgcgggcg 

ggctgtcaag 

tgaactgggc 

ggctgtgctg 

gagtcaggag 

tccgcacttc 

caagggggca 

gcagcttctc 

tggtgaagcg 

gcagcgtctc 

ggaagacgac 

cgaggacctg 

gagaaggagt 

ctgggagcgt 

ctcagcaggc 

ggatgccctt 

caagtgtcac 

tcatgcttct 

tcccaagcta 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
. 2280 
2309 



<211> 2355 

<212> DNA 

<213> Homo sapiens 



<400> 16 

ccgttgcctc 

tgcagggtgc 

gcccaggggg 

accgctctca 

catcacctac 

tagtgacaac 

accatccccc 

cagcctgagt 

cttctataat 

gtcccccagc 

agtgtgtggg 

gggctttttc 

gaattgctcc 

tctctctgtg 

gcagcggatg 

ccagtgcccg 

gccaccccct 

gacgcctccc 

ccgggcccat 

cttcaatgcc 

tcagggctgc 

ggccctaaat 

tgcacaccac 

gtatgcagcc 

tgttctgctg 

ggagatctgg 

tgccaaacac 



aacgtccaac 

ttcggatcct 

caacagcggc 

gctggtgaag 

attggctcca 

tccaatggca 

actggctccc 

gatgacggct 

gggagccccc 

aagagcacca 

gacgttgcct 

cgtcggagca 

atcgtccgca 

ggcatgtctc 

cttgctgaga 

ctggagactt 

gctccggtcc 

agatccccaa 

cgagagatct 

aaccatgcat 

ccacctgccc 

ggtctgcgcc 

agctgccacc 

ccagaaggca 

gcatgtccta 

gaggatttct 

atcccgggct 



ccttctgcag 

gatcgtgagt 

gatcgcaacc 

acatgaccac 

gtggctcctc 

gcttccagtc 

tcacccaaga 

ccccttcttc 

ctgggagtct 

gcaacatcac 

cgggcttcca 

tccagcagaa 

tcaatcgcaa 

gagacgctgt 

tgcagagtgc 

cacccaccca 

cctcacccct 

gccctgagcc 

tcacctacgc 

caggtagccc 

ccaatgacaa 

aggctccctc 

agtccaacag 

aggcacctgc 

tgaacatgta 

ccatgagctt 

tccgtgacct 



ggctgcagtc 
cgcggggtcc 
tctagtttga 
cctggactcc 
cccaagccgc 
cctgacccaa 
cccggctcgc 
ctcatcttcc 
acaagtggcc 
caagctgaat 
ctacggtgtg 
catccagtac 
ccgctgccag 
gcgttttggg 
catgaacctg 
gcaccccacc 

ggtgggcttc 

cacagtggag 
ccatgacaag 
tccagccacc 
caacaccttg 
ctcctaccct 
caacgggcac 
caacagtccc 
cccgcatgga 
cacgcccgct 
ttctcagcat 



cggccacccc 

actccccgcc 

gtcaaggtcc 

aacaacaaca 

accagccctg 

ggctgtccca 

tcctttggga 

tcgtcgtcat 

atggaggaca 

ggcatggtgt 

ctcgcctgcg 

aaaaggtgtc 

caatgtcgct 

cgcatcccca 

gccaacaacc 

ccaggcccca 

tcccagtttc 

gatgtgatat 

ctgggcagct 

accccacatc 

gctgcccagc 

cccacctggc 

cgtctatgcc 

cggcagggca 

cgcagtgggc 

gtgcgggagg 

gaccaagtca 



aagaccttgc 

cttagccagt 

agtttgaatg 

caggtggcgt 

aatccctcta 

cctacttccc 

gcattccacc 

cctcctcctc 

gcagccgagt 

tactgtgtaa 

agggctgcaa 

tgaagaatga 

tcaagaagtg 

aacgagagaa 

agttgagcag 

tgggcccctc 

cacaacagct 

cccaggtggc 

cacctggcaa 

gctgggaaaa 

gtcataacga 

ctcctggccc 

ccacccacgt 

actcaaagaa 

gaacggtgca 

tggtagagtt 

ccctgcttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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ggctggcacc 
gacagtgatg 
gggagacctg 
cgaggaggag 
ggagaattcc 
ggtgctgaag 
ggacctgcgg 
ccagtgaccc 
cttctctctc 
taaaatattc 
cccccatcac 
tcccctggcc 
aaaaaaaaaa 
<210> 17 



tttgaggtgc 
ttcctaagcc 
ctcagtgcca 
ctgggcctct 
gcttcggtgg 
aaccggccct 
accctgaaca 
gcccggccgg 
ctttacgaga 
caagatgagc 
cgaacttccc 
ctctgatttg 
aaaag 



tgatggtgcg 
ggaccaccta 
tgttcgactt 
tcaccgcggt 
agcagctcca 
tggagacttc 
acatgcattc 
ccttctgccg 
cgaaaaggaa 
ctctggcccc 
ctcctcccct 
ttctgttcct 



ctttgcttcg 
cagcctgcag 
cagcgagaag 
ggtgcttgtc 
ggagacgctg 
ccgcttcacc 
cgagaagctg 
ctgccccctt 
aagcaaacca 
ctgagccttc 
atttaaacca 
gtctcaaatc 



ttgttcaacg 
gagcttggtg 
ctcaactccc 
tctgcagacc 
ctgcgggctc 
aagctgctgc 
ctgtccttcc 
gtacagaatc 
gaatcttatt 
ttgtaaatac 
ctctgtctcc 
caatagttca 



tgaaggacca 
ccatgggcat 
tggcgcttac 
gctcgggcat 
ttcgggctct 
tcaagctgcc 

gggtggacgc 

gaactctgca 
tatattgtta 
ctgcctccct 
cccacaaccc 
cagctaaaaa 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
235S 



<211> 4119 

<212> DNA 

<213> Homo sapiens 



<400> 17 
gaattccgtt 
acaccccaac 
atggccgagc 
tgcgacgggc 
accggcggcg 
ccgacagcgg 
gcggcagcgg 
gggtggagag 
ttgaaggtga 
gtgaatacag 
ttaattcttc 
ctggagagag 
gtaaaaaggg 
ctcggaaagg 
gacccaaggg 
tccgggaaga 
acattcgctc 
atgggagtcc 
gccaccaggg 
gtaccggccg 
gtgctggttt 
ttaagcatga 
ctccagaagc 
caccagctgc 
cctattcccg 
aggtgccccc 
ctccacctac 
agcaagatgt 
tcctgcaacc 
cacctcagtt 
catcccagcg 
tggagggaca 
gccctgcccc 
caggtttaca 
cagtgtccat 
ctgctccagg 
ctcccccacc 
gagtgaccta 
ggactcccca 
gttaatacaa 
gaactcagaa 
tgtggctcct 



gctgtcgcac 
acacacacac 
gccgcacgcg 
gcggtgcgta 
gcagcgcgct 
cggctccccg 
ctctctgcct 
cgggggcgcc 
tgctgttctc 
tgaagaggaa 
aacaaaagaa 
gcaaagtggg 
ccctaagcat 
gctcttcttt 
gcgtcagcga 
tgagcaggcc 
agctcataat 
tccacaaaga 
tcttgggggc 
tatgtctgca 
taggcctgtg 
gattagttac 
agatgctcca 
tgctcctgat 
ggcaagaaga 
tcctcctgaa 
taagactggg 
ggcacaacta 
acgggaactt 
taaccggatg 
gcaaagacct 
ttactatgat 
gctgcctcca 
tccccaccag 
gtctccagga 
cgtcatgaac 
accgcctcat 
ctataacccc 



gccagtcacc 
gtttctgaat 
gagaaataca 
accagcaaac 



acacacacac 
acacacacac 
tagcacgccg 
agtacctcgc 
tcgcaagaca 
ttgcggggag 
tcacagcgcg 
aagagtgctg 
tcggattatg 
aactccaaag 
gagaagggag 
gacggacagg 
ttggatgatg 
gagcatgatc 
aagctatgga 
ccaaagtccc 
cctgatgaca 
gatccaaact 
accctaccac 
cccaggaatt 
gaagctggtg 
cggtcacggc 
gtgcttggca 
gctgcaccac 
actcgaacca 
ggactgattc 
acctgggaag 
aatatagcag 
cgaggtatgc 
gaagaaatgg 
gtgccagagc 
ccactgcagt 
cagggcatgc 
acaccagctc 
cagccaccac 
tttggtaatc 
ctgtafcccta 
gcccagcagc 
atcaagcccc 
attttaaatc 
gctggctatc 
agctgaaaga 



acacacacac 
acacacacac 
ggactagcta 
cggtggtggc 
ccgaggacga 
gcgggagctg 
gaggccgaac 
aggagtcgga 
aaagtgcaga 
tggagctgaa 
aagaaaagcc 
agagcacaga 
atgaagatcg 
ttcgagggca 
aggatgaggg 
gacaggagct 
tcaaacctcg 
ggaacggtga 
caaggacatt 
attctcgatc 
ggcagcatgg 
gcctagagca 
gtcctgagaa 
caccccctga 
aagtxggaga 
cagcacctcc 
ctccggtgga 
aacagaattg 
ccaaccatat 
gtgtccaggg 
cccccgcccc 
tccagggacc 
ttgtgcagcc 
ctctgcccaa 
ctcagcagtt 
ccagttaccc 
atacacaggc 
aggtgcagcc 
ctccacctga 
ttaacatcat 
tactaccaga 
ggaggacccc 



acaccccaac 
acacacacac 
tccagcctcc 
cgttctccgt 
ggaatctggt 
cagcggtagc 
cggggccctt 
gtgtgagagt 
agactcggaa 
atcagaagct 
tgacaccaaa 
gcctgtggag 
gaagaatcca 
aactcaggag 
tcgctgggag 
cattgctctt 
aagaatccgg 
gcggctaaac 
tattaacagg 
tgggggcttc 
tggccggtct 
gacttctgtg 
ggaagaggca 
taggcccatt 
tgcagtcaag 
agtcccagaa 
ttctagtaca 
gagtccgggg 
acacatggga 
tggtcgagcc 
tccagtgcat 
aatctatacc 
aggaatgaac 
tccaggcctc 
gcttgctcct 
ttatgctcca 
cccatcacag 
aaagccctcc 
ggttgtaagc 
ataaaaagca 
agggcttcaa 
tgccttcctc 



acacacacac 



acacagcggg 
cagcagcctc 
aagatggcgg 
gcttcgggct 
gccggaggcg 
catctgcggc 
gaagatggca 
ggtgaagaag 
aatgatgctg 
agcactgtga 
aacaaagtgg 
gcatacatac 
gaggaagtca 
catgacaagt 
tatggttatg 
aaaccccgat 
aagtctcatc 
aatgctgcag 
aaggaaggtc 
ggtgagactg 
agggatccat 
gcctcagagc 
gagaagaaat 
cttgcagagg 
accaccccaa 
agtggacttg 
cagccttctt 
gcaggacctc 
aaacgctatt 
atcagtatca 
catggtgaca 
cttccccacc 
tatcccccac 
acttactttt 
ggggcactgc 
gtatatggag 
ccaccccgga 
aggggttcca 
gcagaggtga 
agatataggg 
tgaggacagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
126CA 
1320 W 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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ctctagagag 
ctgtgttcgg 
gcccatgtct 
agctccttcc 
cttaatatta 
aaatgaaaca 
cgtgcatccc 
ctggcctttc 
ctaaccctgt 
gtaggtgggt 
tacctgtact 
gatggcacac 

gggtgtgtat 

cttgcttctc 
gcatctcttt 
cttctccacg 
tctatttttg 
agtcaccttc 
tctgctgcct 
ctggccactg 
cccggtaatg 
acaggtgtgg 
actaagagat 
cttaaacctg 
agacactgtg 
tgtggtggtg 
tggagataat 
<210> 18 

<211> 2653 



agggagaaac 
gggagcagag 
tctgctgttc 
tgtttgtttt 
ttttaatttt 
agtctagtct 
agataatgga 
acttttagtt 
ggaagcatgg 
aggagccctt 
atgatgggct 
cagatgcttt 
tcacatagtc 
ttctttccat 
aaaggcaaat 
tctttcctgc 
tccttgcaga 
tgggcaaggg 
ctgtggaaga 
tccctgtcct 
tcactgtttt 
gatggagagt 
tcttctaggg 
aagatgtctc 
agagttgtct 
ttttttgtta 
aaaatttaga 



aagtggacct 

agagccagac 

ttcacttctg 

gttttctaag 

ttctctttgt 

tctggttttc 

gaatgtatca 

ggcatttgtt 

ctgtctgcac 

ctctttgact 

tctgttctct 

tgtgagaaag 

ctcagggctc 

tcttcttgct 

attatccagc 

tacaagtgtt 

cagaatataa 

ctcctatctt 

gattcctatt 

tctacagaac 

tattccttcc 

ggggagaggc 

gtagctggtg 

ctcaagcctg 

ctgttggtcc 

ctgttttaaa 

ctataaactt 



-23- 

cgtcccatct 

agccccaagc 

ggaaattgaa 

atgttcattt 

ttctgtttct 

tagcccctct 

gccagccttc 

atcctcttgt 

agagggtccc 

taggttttta 

gctgagggcc 

ggatggtgga 

agtcttttga 

acaccccttt 

aagcagtcta 

ttagatgtta 

aaactcctgg 

tcctccctat 

actgcagtac 

ctgagggcaa 

ate t age age 

acttaatctg 

gttgtgcctt 

tgggcagcat 

actgtgttta 

gggtgcccat 

gaaaaaaaa 



tcactcttca 

ttctgagtct 

gtgtcttctg 

ttaaagcctg 

tgctctctct 

ggattccctt 

cccaccaagt 

atacttgtat 

attgtgcaga 

ggagtctgag 

aataccctac 

gtgagagect 

ggtaagtgga 

tecagttget 

ccctgtcctt 

ctaccttatt 

gettaaggee 

ccatggcact 

ataegtctge 

agatggtggc 

tggectaate 

taacceceaa 

ttgtaggctg 

gcccagattc 

gttgcaagga 

ttgtgatcag 



cttgagttgg 

agatacagaa 

ttcccaagga 

gcttcttatc 

ccctgccttt 

ttgactcttc 

ctaaaaagac 

tcccttaact 

aaagctcaga 

catccatcaa 

tgtggggaga 

ttgectttag 

attagagggc 

gtggaccaat 

tgeaattget 

ttccccgaat 

taaggaagee 

aaaccacttc 

caggggtaac 

tgtgtctctc 

actctgagtc 

ggaggaaata 

ttccctttgc 

ccagacctta 

tttttccatg 

cattgtgact 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
•' 4119 



<212> DNA 

<213> Homo sapiens 



<400> 18 

gagegegget 

gtccgagagg 

gaggattget 

ttceggegag 

acaatcagtt 

gatgecaaac 

ctgcctctca 

cctccttgtt 

cttaagggac 

gaactageca 

acaacaaatt 

aagcaagaag 

cggctgcctg 

tgtgggaaaa 

tgettaagee 

ctgaattgea 

tgtcaggaag 

atgactgcag 

ageaaaggee 

ttggtgctga 

caagctagag 

atagecacta 

getgeagtte 

ctggatgttt 

etcaaaccac 

cttattcaca 

gagggagcac 

ttgatcaggc 



ggagtttgct 

ctgcgtgtga 

egaggaggee 

gectgagctg 

ttccaaaaag 

tagaaccaac 

gecceaggaa 

ctccaccaaa 

gaagattggt 

aagttcaeca 

ctgagcagag 

geacttgeta 

ccagggaaag 

aagctggaag 

ggattctgea 

tgtccttgag 

aggtatccag 

agaagggece 

aggatgtatt 

ttggtattgc 

aaaaatgtaa 

ttttgeaaga 

aattctgtgc 

gcaggagagc 

tgtctgaatg 

tatcccaagt 

aagattcctt 

agttgaaaat 



gctgccgctg 

gagaegtgag 

tggggtctgt 

tgctgtcgtc 

gaagctgtct 

aaatgtccaa 

aegtctggge 

gcaaggcaag 

atttgacaat 

aaacaaaata 

atgtccactg 

ccagcaagca 

ggagatggat 

cctttacctt 

agacctcaag 

gactgcccag 

gccagctggg 

catgattgtg 

gtacaegcta 

taataccctg 

gccacagctg 

tcgacttaat 

ccgcaaagtc 

tattgaaatt 

taaatcacct 

catctcagaa 

ccctcttcag 

caaagaggtc 



tgcagtttgt 

aaggatcctg 

gagacagegg 

atgcctcaaa 

egggcattga 

accgtaacct 

gatgacaacc 

aaagagaatg 

cagctgacaa 

ctttcttcag 

aagaaagaat 

aagctggtcc 

gtcatcagga 

tctggtgctc 

aaggaactga 

gctgtattcc 

aaggacatga 

ttggtattgg 

tttgaatggc 

gatctcacag 

ttgaacttcc 

caggtatcta 

tctgctgttt 

gtagagtcag 

tctgagcctc 

gttgatggta 

cagaagatct 

actctgggg a 



tcaggggctt 

eactgaggag 

agctgggtga 

cccgatccca 

acaaagctaa 

gttctcctcg 

tatgeaacac 

gtccccctca 

ttaagtctcc 

ttagaaaaag 

ctgcatgtgt 

tgaaeacagc 

atttcttgag 

ctggaactgg 

aaggctttaa 

cagctattgc 

tgaggaaatt 

acgagatgga 

catggctaag 

atagaattct 

caccttatac 

gagatcaggt 

caggagatgt 

atgtcaaaag 

tgattcccaa 

acaggatgac 

tggtttgctc 

agttatatga 



gtggcggtga 

gtggaaagaa 

aggctgeggg 

ggcacaggct 

aaactceagt 

tgtaaaagee 

tccccattta 

ctcacataca 

tagcaaaaga 

tcaagagatc 

gagactattc 

tgtcccagat 

ggaacaeatc 

aaaaactgee 

aactatcatg 

tcaggagatt 

ggaaaaacat 

tcaactggac 

caattctcac 

acctaggctt 

eagaaatcag 

tctggacaat 

tegcaaagea 

ccagactatt 

gagggttggt 

ettgagecaa 

tttgatgetc 

agectacagt 



\ 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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-24- 

aaagtctgtc gcaaacagca ggtggcggct gtggaccagt cagagtgttt gtcactttca 1740 

gggctcttgg aagccagggg cattttagga ttaaagagaa acaaggaaac ccgtttgaca 1800 

aaggtgtttt tcaagattga agagaaagaa atagaacatg ctctgaaaga taaagcttfca 1860 

attggaaata tcttagctac tggattgcct taaattcttc tcttacaccc cacccgaaag 1920 

tattcagctg gcatttagag agctacagtc ttcattttag tgctttacac attcgggcct 1980 

gaaaacaaat atgacctttt ttacttgaag ccaatgaatt ttaatctata gattctttaa 2040 

tattagcaca gaataatatc tttgggtctt actattttta cccataaaag tgaccaggta 2100 

gacccttttt aattacattc actacttcta ccacttgtgt atctctagcc aatgtgcttg 2160 

caagtgtaca gatctgtgta gaggaatgtg tgtatattta cctcttcgtt tgctcaaaca 2220 

tgagtgggta tttttttgtt tgtttttttt gttgttgttg tttttgaggc gcgtctcacc 2280 

ctgttgccca ggctggagtg caatggcgcg ttctctgctc actacagcac ccgcttccca 2340 

ggttgaagtg attctcttgc ctcagcctcc cgagtagctg ggattacagg tgcccaccac 2400 

cgcgcccagc taatttttta atttttagta gagacagggt tttaccatgt tggccaggct 2460 

ggtcttgaac tcctgaccct caagtgatct gcccaccttg gcctccctaa gtgctgggat 2520 

tataggcgtg agccaccatg ctcagccatt aaggtatttt gttaagaact ttaagtttag 2580 

ggtaagaaga atgaaaatga tccagaaaaa tgcaagcaag tccacatgga gatttggagg 2640 

acactggtta aag 2653 
<210> 19 

<211> 2907 

<212> DNA ^ 
<213> Homo sapiens 



<400> 19 

gccatctggg 

ctgccagact 

tcctgcccga 

ttcttccccc 

cagcttccag 

agttctgaag 

ccttgctttg 

ggctgcaagg 

gacaagaact 

aagtgctttg 

aaggaggtgc 

ctcattgaga 

aaatacacta 

aagttcagtg 

cccggcttca 

gacatcctga 

tcggacgggc 

gacctggtct 

gggctgctca 

cgggtggaca 

aggcccagcc 

atcagcgcca 

ccgcctctca 

ccggggggtg 

agcctcagcc 

ccacatggac 

tgaccccgca 

ggggacgggg 

gcctgctccc 

tggtcctggg 

cagctcacca 

tcacaagcca 

agagggggtg 

ggtattaatt 

agaattttca 

tggatccaga 

tctcctttca 

ctctttaaaa 



cccaggcccc 

gtctgcctcc 

cacctggggg 

ctatgctggg 

ttagtggata 

agatagtgcc 

tctgtcagga 

gcttcttccg 

gcatcatcaa 

aagtgggcat 

ccaagcccga 

aggtgcgcaa 

cgaacaacag 

aactctccac 

ccaccctcac 

tcctgcggat 

tgaccctgaa 

ttgccttcgc 

gcgccatctg 

tgctgcagga 

gcccccacat 

agggggctga 

tccaggaaat 

gggggcggga 

ccagctccaa 

acagccctcg 

ccagccctgc 

agggaggagg 

acagcctggg 

tctcaggatg 

catcttcatc 

ttgctcccca 

ggacaggggc 

ctcgctggtt 

ttttaagcac 

gctggagggg 

tgtccctgtg 

ctgtgaagta 



atgccccgag 

cttctgactg 

cgggcacctc 

tggactctcc 

tagcacacca 

cagcccfcccc 

caagtcctca 

ccgcagcatc 

caaggtgacc 

gtccaaggag 

gtgctctgag 

agcgcaccag 

ctcagaacaa 

caagtgcatc 

catcgccgac 

ctgcacgcgg 

ccggacccag 

caaccagctg 

cctcatctgc 

gccgctgctg 

gttccccaag 

gcgggtgatc 

gttggagaac 

cgggggtggc 

cagaagcagc 

ccctccgccc 

ccccacctgc 

cagcgactcc 

ctgacgtcag 

ggtcctgggg 

accagcaaac 

gctggggaac 

ggggggttcc 

ttgtttttat 
atttatactg 
gtgggtccgg 
ccccccagtt 
ctaactttcc 



gaggggtggt 

tggccgcttg 

aatgggtacc 

ccgccaggcg 

tccccagcca 

tcgccacccc 

ggctaccact 

cagaagaaca 

cggaaccgct 

tctgtgagaa 

agctacacgc 

gaaaccttcc 

cgtgtctctc 

attaagactg 

cagatcaccc 

tacacgcccg 

atgcacaacg 

ctgcccctgg 

ggagaccgcc 

gaggcgctaa 

atgctaatga 

acgctgaaga 

tcagagggcc 

ctggcccccc 

ccggccaccc 

cggcttttct 

cctcccgggc 

ttggacagag 

aggccgaggc 

gcctcgtgtt 

gccaggactt 

ctcaacctcc 

ccctgtacat 

tttaattttt 

aaggaatttg 

gggagggagt 

ctcctcctca 

aaggcctgcc 



ctgaagccca 

gcatggccag 

cggtgcctcc 

ctctgaccac 

ccattgagac 

ctctaccccg 

atggggtcag 

tggtgtacac 

gccagtactg 

acgaccgaaa 

tgacgccgga 

ctgccctctg 

tggacattga 

tggagttcgc 

tcctcaaggc 

agcaggacac 

ctggcttcgg 

agatggatga 

aggacctgga 

aggtctacgt 

agattactga 

tggagatccc 

tggacactct 

cgccaggcag 

actccccgtg 

ctgcctttct 

agtactgggg 

gcctgggccc 

caggaactga 

catcaagaca 

ggctccccca 

cccctgcctc 

accctgccat 

ttgttttgat 

tgctgtgtat 

ggctcggaag 

gccttttcct 

ttcccctccc 



ccagagcccc 

caacagcagc 

ctacgccttc 

tctccagcac 

ccagagcagc 

catctacaag 

cgcctgtgag 

gtgtcaccgg 

ccgactgcag 

caagaagaag 

ggtgggggag 

ccagctgggc 

cctctgggac 

caagcagctg 

tgcctgcctg 

catgaccttc 

ccccctcacc 

tgcggagacg 

gcagccggac 

gcggaagcgg 

cctgcgaagc 

gggctccatg 

gagcggacag 

ctgtagcccc 

accgcccacg 

accgaccatg 

accttccctg 

tcagtggact 

gtgaggcccc 

cccctctgcc 

tcctcagaac 

ggttggtgac 

accaacccca 

ttttttaata 

tggggggagc 
gggcccccac 
cctcagtttt 
tcccactgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
96^ 
102M 
108<T 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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-25- 

gaagccgcca gcccctttct ccctctgcct gaccactggg tgtggacggt gtggggcagc 
™^ aa " a ca 99 ctcct 9 3=cttggcac ttgcctgcac ccallatglg Icltfgagfa 
?™r 9Ca a 999 cccc 99 gacagagttt tcccagacct ggctcctcgl lagaglSgcc 
«™ 9 ? 3Cccacatca tctaggctcc ccagccccca ctgtgaaggl gcfggccJgg 
9 cccc ^ accc <=cggcctcag ccaccagcac ccccatajgl cccfJagaS 
ccacacacat gcgcgtgcgc acacacacaa acacacacac actggacagt agatggqcca 
a ^ aCaC ^ Wccgagtt cctccatttc cctggcctgc cccIIaccL SacSgtcI 
cta^o^ 9 ccc ° ctcctt accccgcagg acgggcctac aggggggtct cccctclccc 
^ 9 "^ C ° C a 9ctggggga gctggctctg ccccgacctc cttcaccagg ggttggggcc 
gataaagS SSSSS lllttT^ ctttccactj Satl?altg 

<210> 20 

<211> 2096 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> miscJEeature 

<222> (23) . . (23) 

<223> n=a, c, g or t 

<220> 

<221> misc__feature 

<222> (27).. (27) 

<223> n=a, c, g or t 

<220> 

<221> misc_feature 

<222> (80).. (80) 

<223> n=a, c, g or t 

<220> 

< 2 2 1 > mis c_f eature 

<222> (120) . . (120) 

-<223> n=a, c, g or t 

<400> 20 

^of^** aaatacttt 9 atnctcngtt tccacctctc ttaaattgtc tttccctatg 
tSactata a . a ! «f ° n aa 9ttcgcaa tgagaacaac catctaaaan 

IV? t t° ta 99 tca 9gc gcggttgctc atgcctgtaa tcccaccact ttgggaggcc 
9 f tCaCC ^ ga 99tcaggatt ttgagaccag cttgaccaac at£tg?at 
Ctaaa ! atac aaaaaattag ccgggtgtgg tggcacaccc ctiEaltccc 
acctactcag gaggctgagg caggaaaatc ccttgaaccc aggaggcaaa aottacatto 
agccgaaata acaccactgc actccagcct ggacgataga gfgJgJcScc Sct?aaaa? 
aagagcagct gtgacaaatg cctgtattga attg C agg? c Ig?c??ccac ctccac^acc 
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26 



ggtgccaaaa 

gtctctcaaa 

tctgatgatt 

agtgttgatc 

gttcttaaaa 

tgtacatatt 

gccttagatc 

tgatacattt 

cttttctctt 

ctgtggctcc 

cagatgaaga 

gcccaagact 

tggggaaggt 

tttttataat 

tctgtgtttt 

gtagaaatag 

gctaaaacta 

gatatgagat 

atagaacttt 

ttttctttaa 

aagaaaaaca 

agtagttatg 

tctcccaaag 

atttgatcca 

tcattttggg 

ttatatgtgt 

ctattagcta 

<210> 21 



aaagggctgc 

agcctgatcc 

ctgactctaa 

ctagtcagtc 

atttggcata 

tagttttaac 

tgcttcttat 

ttttacatta 

ttagaaatcc 

tcgggcaaaa 

tgatctgttt 

ggttttaaag 

gtttttagta 

actgtataaa 

gagtctgctt 

ctatctgatc 

gagcagtttg 

agagcataat 

atggttctag 

gcaatgagaa 

ctttggctgt 

tgattatttc 

agagaaacca 

agatcttaaa 

aaatctccat 

gtgacttgag 

aattccaaca 



cccaaaagga 
tgccaaaacc 
ttttgagaaa 
cttttgctgt 
tctttaagaa 
aaagttaaat 
tacttggtag 
caattatgtt 
aagggggaga 
tctgtacggg 
taaaatgtga 
ttacctgaag 
caagacatca 
tagtgaccat 
cttttgtctt 
acttcagcgt 
atttaaaagt 
tatctgtttt 
tacagatact 
attgctcatg 
gtctataact 
agctcttgac 
atttctaaga 
tgttatattg 
aatttcaatt 
taatgttatc 
attttgtact 



actaaaaggg 
aagaatcgcc 
attgtttcga 
agatgttctg 
aattaactct 
atgccactta 
aaaatagaaa 
gtcagattta 
gtgatgactt 
caaagaaacc 
ggcgattatt 
ctcttaactt 
aagtgaagta 
ctcatgggca 
taaaacctga 
aaagcagtgt 
gtcactcttc 
atcttagttt 
ctactacact 
ttcttcatct 
tgacacagtc 
ctgtcccctc 
ggactggatt 
ataaccatgc 
tgtaaacttt 
aacgtttttg 
ttaataaaat 



atccagcttt 
gcaaaaggaa 
aagcagtcac 
aaacacgtaa 
catattctgt 
tttggccaat 
actccttgaa 
caatgtgcaa 
ccatatggac 
tataaagtac 
ttaagtaatt 
cctcccctct 
aagcccaagt 
ttgttttctt 
tttttaagtt 
gtttattaac 
ctccttttct 
tatacataat 
cagcctctta 
tctcaaatca 
aatagaatga 
tggctgcctc 
gcagaagact 
tcagcaatga 
gttaagacct 
taaatattta 
gttctaaaca 



gaattctggt 540 

gccatccact 600 

a age aagg t g 660 

etaagecatt 720 

tagcttttac 780 

ggaagagttg 840 

tatagtgtct 900 

gttacctggg 960 

tttgactcag 1020 

ctggaagagt 1080 

atcttaccaa 114 0 

gaatttagtt 1200 

gttctttagc 1260 

etctgetttg 132 0 

cttctgaact 1380 

catccactaa 1440 

actttcagta 1500 

ttaccatcag 1560 

tgtgccaagt 1620 

teagaggecg 1680 

agaaaattag 1740 

tgagtctgaa 1800 

eggggae aac 1 8 6 o{ 

gctattagat 1920 

gtctacattg 1980 

ctatgttttt 2040 

ttgaaa 2096 



<211> 2160 

<212> DNA 

<213> Homo sapiens 



<400> 21 

agccccctgc 

gcgcctcggc 

cccgttcgcc 

cggccaggcc 

ctgcgcccag 

ctcgtggccg 

cactgcccgc 

gagctggtgc 

ccctgcgggg 

gtggagaagc 

ategaggeca 

aacagcttca 

attcgagacc 

gcccggcctg 

gccgcttcac 

cgcaacggca 

tgctggtgtg 

gagctggact 

ggactcagcg 

gtctgagtct 

cacgtgtgcg 

gccagagcct 

tageccaaga 

tcattccttc 

tctagttttc 

actcctataa 

gagaagagac 

cctaaggaag 



ccctcgccgc 
ggecaggett 
cgccgggccg 
gtgccgccgt 
cgccccgcgc 
ccctgctgct 
cctgctccga 
gagaggeggg 
tgtacacccc 
r.cctgcacac 
tccaggaaag 
gcccctgtag 
ggagcaccag 
tgccccaggg 
agagccgcac 
acttccaccc 
tggaccggaa 
gccaccagct 
tcccctgcta 
gagtcctgtc 
tgtgcgtgcg 

gggg t 9 ttct 

ggtctgagee 
actcatccag 
agccttggga 
ggagagttca 
atgtaccttg 
cgtggggtag 



cccccgccgc 
gctcccctcc 
ccccgtctcc 
gtgccctccg 
ccgcgcccca 
ggccgccggg 
ggagaagctg 
ctgcggctgt 
ccgttgcggc 
actgatgeae 
cctgcagccc 
cgcccatgac 
tgggggcaag 
ctcctgccag 
ccacgaggac 
caagcagtgt 
gaegggggtg 
ggctgacagc 
ctcctgtgct 
tctgcctgcg 
tgtgtgtgtg 
ctttggtgtt 
ctggtgtgtt 
ccacctaaaa 
ggttttattc 
agcctgtggg 
accatcgtcc 
cagatggagt 



ctgcctgggc 

ggcacgcctg 

ccgcggcctc 

ccgctcgccc 

gtcctcgggc 

cccgggccga 

gcgcgctgcc 

tgcgccactt 

tcgggcctgc 

gggcaaggcg 

tctgacaagg 

cgcaggtgcc 

atgaaggtca 

agegagctge 

ctctacttca 

cacccagctc 

aagcttcegg 

tttcgagagt 

ctggaggctg 

geccagaagt 

tttgtgagca 

acacagccca 

tccagatcga 

acatttactg 

tgacttcctc 

agtagaaaaa 

ttcctctcaa 

aatggtcacg 



egggecgagg 

ctaacttccc 

egggtceggg 

gcgcgccgcg 

ggtccatget 

gectgggega 

gcccccccgt 

gcgccctggg 

gctgctaccc 

tgtgcatgga 

acgagggtga 

tgeagaagea 

atggggcgcc 

accgggcgct 

tccccatccc 

tggatgggca 

ggggcctgga 

gaggcctgcc 

cagagctgac 

ttccctcaaa 

tgggtgtgcc 

agaggactga 

tcctggattc 

accatgtact 

tgattttggc 

tctcattccc 

gctagcccag 

aggtccagac 



atgeggegea 

ccgctacgtc 

tcctccagga 

cgctccccgc 

gcccctctgc 

cgaagccatc 

gggctgegag 

cttggggatg 

gccccgaggg 

getggeggag 

ccaccccaac 

cttcgccaaa 

cegggaggat 

ggagcggctg 

caactgcgac 

gcgtggcaag 

gecaaagggg 

ageaggecag 

ccagagtgga 

tgcgcgtgtg 

cttggggtaa 

gactggcact 

actcactcac 

acgtgccagc 

atgtggagac 

agagtcagag 

agggtgggag 

ccactcccaa 



60 

120 

180 

240 

300 

360 

42CjA 

480^ 

540 

600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



BHC 03 1 051 



-27 



agctcagact 
gttgcaccat 
gtcaaaggaa 
cagggtggga 
gctaggggat 
gaagtcaccg 
tccatatctc 
tggcttgact 
<210> 22 



tgccaggctc 
ctgcttggtt 
gacttgaagc 
agaaagaatg 

gaggggcttc 

ggatgaacct 
cttcccctac 
ggatggaagg 



cctttctctt 
ggctggcagc 
acagagggct 
caagagtgga 
ctgggtcctg 
atccttccag 
acctccctcc 
agacttagga 



cttccccagg 
tgagagccct 
agggaggtgg 
ctgaatgtgc 
ttcccctacc 
tggctcgctc 
ccacacctcc 
acctaccagt 



tccttccttt 
gctgtgggag 
ggtacatttc 
ctaatggaga 
ccatttgtgg 
cctgtagctc 
ctactcccct 
tggccatgat 



aggtctggtt 
agcgaagggg 
tctgagcagt 
agacccacgt 
tcacagccat 
tgcctccctc 
gggcatcttc 
gtcttttctt 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



<211> 2215 

<212> DNA 

<213> Homo sapiens 



<400> 22 

ctgcagggag 

ctcaagaaaa 

agccacccag 

caagatgagg 

gagtctttgt 

tactccatca 

tatttcaaga 

atcctcttcc 

ggtgtccact 

ctcctacttc 

caccgccacc 

ctagccacag 

gagcaagcga 

gtggcccaga 

gtcatcatcc 

atcatcgctg 

gcccagacgg 

aacatcgcct 

ttgtacgcct 

ggctgcctca 

tccatgagtg 

ggactagagg 

caggacatcc 

ccctgctcca 

cagggctgat 

aaccaaaaac 

aggagggtga 

gcctcccaca 

accagagata 

acaaactctc 

tggcagctgc 

caggctctca 

gatcaaagcc 

ctgggatggg 

caaggccacg 

ctgctttcga 

aggaaacaac 

<210> 23 



ccatgattgc 

aaaaaaaaga 

ctaaactggc 

tcacggacga 

gctccaagaa 

tttgtttcgt 

ggctcaagac 

tcctgaccct 

tttgcaagct 

tttgcatcag 

gtgcccgcgt 

tgctctccat 

tgcgatgctc 

tggtgatcgg 

gcaccctgct 

tggfccgtggt 

tggccaactt 

acgacgtcac 

tcatcggcgt 

gccaggagca 

tggaggccga 

gacctctccc 

ccccgccaaa 

gaagttagct 

aagctaacac 

tgaaagtgaa 

gtgcaagggg 

gactcaaatg 

gtggggagac 

tgaacccctc 

cacgccgccc 

acaggggaga 

acactctggg 

aggagaggac 

agcttgttct 

ttcgttaaga 

agctttaaaa 



accactgcac 

aagaaaccac 

ctgttttccc 

ttacatcgga 

ggacgtgcgg 

gggcctactg 

catgaccgat 

tcccttctgg 

catctttgcc 

cattgaccgc 

ccttctcatc 

cccagagctc 

tctcatcaca 

ctttctggtc 

ccaggcacgc 

cttcatagtc 

caacatcacc 

ctacagcctg 

caagttccgc 

gctccggcag 

gaccaccacc 

agggtccctg 

agctgctcag 

tcaccccaat 

cagacagaca 

agtccagaaa 

cgtgggagtg 

ctcagaccag 

ttcttggctt 

cctccatcgt 

taaaagcaca 

gtgtggtgtt 

ctccagagtg 

aagggaaatg 

ttgttctttg 

gagcaacatt 

gaaaaaagaa 



tccagcctgg 

tgctctaggc 

tcatttcctt 

gacaacacca 

aactttaaag 

ggcaatgggc 

acctacctgc 

gcctacagcg 

atctacaaga 

tacgtggcca 

agcaagctgt 

ctgtacagtg 

gagcatgtgg 

cccctgctgg 

aactttgagc 

ttccagctgc 

agtagcacct 

gcctgcgtcc 

aacgatctct 

tggtcttcct 

accttctccc 

gggtggggat 

ggaaaagcag 

cccagctacc 

acactgggaa 

ctgttcccac 

gcctgaagag 

ctcttccgaa 

ggtgaggaaa 

tttcttcact 

ctcatcccct 

tcctgcaggc 

gggatgacat 

tcaggggcgg 

tcacagggac 

ttacccacac 

aaaaaaagct 



gcaacagagt 

taaatcccag 

ccccgaaggt 

cagtggacta 

cctggttcct 

tggtcgtgtt 

tcaacctggc 

cggccaagtc 

tgagcttctt 

tcgtccaggc 

cctgtgtggg 

acctccagag 

aggcctttat 

ccatgagctt 

gcaacaaggc 

cctacaatgg 

gtgagctcag 

gctgctgcgt 

tcaagctctt 

gtcggcacat 

cataggcgac 

agggagcaga 

ctctcccctc 

tcaaccaatg 

acagaggcta 

ctgctggagt 

tcctctgaat 

aaccaggcct 

agcggacatc 

gtcctccaag 

cacttgccgc 

caggccagct 

gcactcagct 

ggagggtgac 

tgaaaacctc 

acagataaag 

tggtaagtca 



gagaccatgt 

ccagagttgg 

atgcctgtgt 

cactttgttc 

ccctatcatg 

gacctatatc 

ggtggcagac 

ctgggtcttc 

cagtggcatg 

tgtctcagct 

catctggata 

gagcagcagt 

caccatccag 

ctgttacctt 

catcaaggtg 

ggtggtcctg 

taagcaactc 

caaccctttc 

caaggacctg 

ccggcgctcc 

tcttctgcct 

tgcaatgact 

agagtgcaag 

ccgaaaaaga 

ttgtccccta 

gaaggggcca 

gaaccttctg 

tatctccaag 

agctggtcaa 

ccagcgggaa 

gtcgccctcc 

gcctccgcgt 

cttggctcca 

agtggccgcc 

tcctcatgtt 

ttttcccttg 

agtag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
/ .900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2215 



<211> 958 
<212> DNA 



<213> Homo sapiens 
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<400> 23 

ggggccggac 

ggcgcttttt 

ccaaaatgtc 

tgcccagcac 

acaggctggg 

ttccaaaacc 

aggtctggga 

ttattggtgg 

acgaagcaga 

accttgctta 

agagacaatc 

cagatgatta 

accaccgcct 

tcacaacagc 

gaaaactaga 

tcctggaaag 

<210> 24 



gcgaggggcg 
ctgcccgcgg 
aaaaagacca 
accagggttt 
agggaacccg 
cccaaagcca 
ccaagtaaag 
catgtggcga 
aaagatagag 
cataaatgca 
tcgcatggag 
tgatgatggc 
catcagtgaa 
tagaatgcag 
agctgaactt 
cacagattca 



gggcgagcgc 
tgtctcagat 
tcttatgccc 
gtgggataca 
agcaccaaca 
ccagataagc 
gcttccaacc 
gatctcactg 
tacaatgaat 
aaaagtcgtg 
aaaggagaac 
ttttcaatga 
attcttagtg 
gtcctcaaac 
cttcaaatag 
tttaacaatg 



gggacaaagg 
tcattcttaa 
cacctcccac 
atccatacag 
gccgggtcac 
cgctgatgcc 
ctgacctaaa 
atgaagaaaa 
ctatgaaggc 
cagaagctgc 
cgtacatgag 
agcatacagc 
agagtgtggt 
ggcaggtcca 
aggaacgaca 
aacttaaaag 



gaagcgaagc 
ggaactgaga 
cccagctcct 
tcatctcgcc 
ggcatcctct 
ctacatgagg 
gttgtgggag 
acaagaatat 
ctatcataat 
tttagaggaa 
cattcagcct 
caccgcccgt 
gccagacgtt 
gtccttaatg 
ccaggagaag 
gttgtgcggt 



cggagctgcg 
acttaatctt 
gcaacacaaa 
tacaacaact 
ggtatcacga 
tacagcagaa 
attggcaaga 
ttaaacgaat 
tcccccgcgt 
gaaagtcgac 
gctgaagatc 
ttccagagaa 
cggtcagttg 
gttcatcagc 
aagaggaaat 
ctgaaagt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
958 



<211> 6483 

<212> DNA 

<213> Homo sapiens 



<400> 24 

aagcttctaa 

caacttcaac 

actttgggag 

catgatgaaa 

cctgtagtcc 

agcttgcagt 

cgtctaaaaa 

cattcctatc 

acataattgt 

gggtagatat 

tataccaaat 

acctagcttg 

tctaatatag 

ttttcccttt 

tcatgctaag 

tatgatgaat 

tgttaaaaat 

ggcaaaatat 

tctcctaaat 

taaaatttta 

agcafcgatcc 

aatttgaaaa 

gcaaacattc 

tgttcttcaa 

atttcaacat 

gattggttat 

aagaaaacaa 

aggcttgtca 

ttcgatacag 

gaggatgtgg 

ttggtggagg 

gtgggggatg 

gtagctttca 

gagggggcaa 

gtggaggcgg 

ctggaaatga 

aagttcgggc 

aaaagcatgg 



ttgcagttca 

ttcctcttct 

gccaaggcag 

ccctgtctct 

cagctactcg 

gagccgagat 

aaaaaaaaga 

tcaacagctt 

aaaacatgta 

aatcttatac 

tctgtacaca 

ggaacacatg 

atttacattt 

ttactcaatc 

aaaataatgc 

ttaaaagtgc 

aatgccatca 

caatatactc 

tgtggtttta 

gaaactcttt 

tgccagsctt 

atcttatttt 

tatgcaacct 

caggtgaaaa 

agcatacctc 

tactgaagat 

tgtaagtaag 

ctgtccttgg 

ctcaagcaag 

aggaggagga 

atttagctca 

ctttgggggc 

tggaagctat 

tttcggaggt 

ctttggagga 

aaaagtaacc 

tctggaagaa 

caactcacat 



accacctgtt 
ataaattaga 
gcagatcaag 
accaaaaaga 
ggaggctgag 
ggcgccactg 
aagaaagaaa 
gttctagaaa 
tgaatatata 
aatgtaccaa 
gtatatccaa 
tgcacagtct 
ttgcaagcaa 
agaacttagt 
agattttctg 
aaaataatcc 
ttaattagaa 
tcatttgcca 
tggctttaat 
ctttttgatt 

aactataatt 
gaattagtgc 
gcataataaa 
gtgtttattc 
aatttatgca 
caaactctaa 
tagcaggcac 
cactactctt 
ggagtgtcat 

ggggggttca 

tcatcaggtg 
ggaagtagca 
ggcagctttg 
ggctttggtg 
atgcagaatc 
tcaaactatg 
cagggggagc 



acatatcttc 

aataacaata 

aggtgaggag 

caaaaaatta 

gcaggagaat 

cactccagcc 

gaaagaaaga 

tttttaaagc 

catccaaaca 

aatcccaatt 

gaaaatgtgt 

agttcataat 

atttttactt 

gtaaagtact 

cattattatg 

aggataactt 

aattctaaaa 

aataaattaa 

gttttatgtg 

ccctacagtt 

gttgctccaa 

ttgtagctga 

tgagaaaatt 

acatgctcat 

ttagggcaaa 

atcataagcc 

cacatgtgga 

tccctgggct 

cctcccgcag 

ccctaagaat 

gtggtggctc 

gctatggagg 

gctttggtgg 

Sftgggggcag 

gtggatttgg 

tgaatgaccg 

agctggaagg 

ctcgtgacta 



aggaaaaaat 
accacacctg 
attgagacca 
gccaggtatg 
ggcgtgaacc 

tgggcgacag 

aaagaaataa 

acagtatcac 

acagcaatgt 

tacttcacta 

tgtttttatt 

atttggtgca 

gcaatcgtaa 

acaagttagt 

gtcttcacag 

tatgatttca 

tcattacttc 

aagatctcct 

tggcaactat 

gtctacaaga 

ttaaaactgt 

aacttttttt 

ggatcttaat 

ctgaactcca 

ttcaaaattg 

aaagatgcta 

cacaccctct 

aaacagcatc 

tggaggagga 

ttctagcagc 

ttttagccgt 

attaggaggt 

gagttatgga 

ctttggtgga 

aggagatggt 

cctggcttcc 

caaaatcaag 

cagcaaatac 



cacaacctct 

taaccccagc 

tcctggctaa 

gtggcacaca 

cgggaggtgg 

agcaagcctc 

taataaccac 

aaacagcact 

catagcctat 

gacaaactgt 

gagaaactga 

agtatcattc 

catatccaaa 

tcttcggatt 

aaaccttaac 

cattttttaa 

cactttctta 

acaaacacaa 

tgatgctagt 

accttattgt 

ttaaaacacg 

tctaaacttt 

ggttgctcaa 

cccattttca 

tacatattag 

agttggcaaa 

cagtatataa 

accatgtctg 

ggaggaggag 

aaaggctccc 

gggagctctg 

tttggtggag 

ggcagctttg 

ggcggctttg 

ggccttctct 

tacttggaca 

gagtggtatg 

tacaaaacca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
96^ 
102^ 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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tcgatgacct 
atgtaatcca 
catgttactt 
tatggtcatg 
tactaagcac 
gctatttaca 
attcagactg 
tatgaaaatg 
gttcagagtg 
agccaaagca 
caggaataca 
tacaagactg 
tgcaatattt 
aagtgcagag 
taactgtgcc 
cagatcgaca 
ttgtataata 
caataacatt 
aaattttcca 
gtaactgcta 
aatattgcca 
acatctcccc 
tgaggtagct 
tgagctgacc 
ggcctatctg 
agagagttca 
gaaagacctt 
tgttgatctg 
aaaccgcaaa 
aaactcatgg 
ctagagcaag 
atctgagatt 
actggccttg 
gctcacctaa 
acaaagtgaa 
aaactgaaat 
taatgcctta 
aagccacaaa 
ggtcgctact 
ttgcaacaga 
attaagatcc 
aggtaaatta 
ggcttatcta 
gtacagaaaa 
ttcttaagag 
tggcagggtt 
agctacccac 
cggaggcggc 
cggcggaagc 
cggaggcgga 
cggcggccac 
cggtggtggc 
cagcagctcc 
ttccgggtcc 
taatctagaa 
gttttaaaaa 
taacaccagc 
ttaacaatgt 
atccctacag 
agtaatcaag 
tgaatcagaa 
acacagacca 
tgatggtcta 
gtatttcttc 
gttctcattt 
tcagaattta 



taaaaatcag 
gacagatgaa 
tcttccttca 
gcaatcttca 
ttttgaggtt 
aatgtagaaa 
ataacaagcc 
ggacactgct 
tatttattta 
tggctgaaat 
tctatataat 
gagaaattcc 
catatttaaa 
tgcatctatg 
tctccctcca 
atgccaggct 
ctatcacaac 
tacatttttc 
tgtcagtgtt 
aggttttttc 
ttacatgaga 
tatataagtt 
ctgcgccaga 
ctgaccaagg 
aagaagaacc 
ttatggtcct 
cgaaatgtgt 
actcaacttc 
gatgctgaag 
aggttttatc 
gaactgacta 
actgaattga 
gtatgttaac 
gaatatgcaa 
gctcatgtta 
ggtgccatta 
ctaaataatc 
aaaatccgtt 
gtgtgcagct 
ttcgagctga 
gactggagaa 
taacatgaaa 
aaccccaaga 
aggagtacgc 
ttctgtgaac 
ttattttttc 
cgttttccaa 
ggacgcggcg 
tccggcggcg 
agctccggcg 
ggcggcggaa 
agttccggcg 
ggcggcggat 
gtgggcgagt 
ttagttttaa 
attaaaaatc 
cgtgaaaaat 
aaagttgaaa 
actgtttaaa 
acaattattg 
accaagaaac 
ttatttacag 
tggaaataga 
ttttcacttt 
actaatgaat 
tctttactta 



gtaagaggta 
tcttaaatta 
aaaataaccc 
tccagcaagt 
ttaatgagca 
ctgaaaaaaa 
atattcagta 
tgcttctaag 
ttagtattta 
acatggaaat 
tccagttagt 
aagagaacaa 
ataaaaaatt 
tcactacaaa 
gattctcaac 
ggcagctgat 
gaatacatca 
tgaattcacc 
acctttttgg 
cattaaacca 
tcaactatgt 
ataaccagta 
gcgtggaggc 
ctgacctgga 
acgaggaggt 
cgtgtagcca 
ccactggtga 
tgaataacat 
cctggttcaa 
atttcagaat 
cagaaattga 
gacgtaatgt 
tctcatgaaa 
tgcaacaaaa 
atggaggaaa 
aacatacaac 
aaataagcgt 
tcagaaacaa 
ctcacagatt 
aaccgagtgc 
tgaaattcaa 
agttatccca 
agatgccaga 
aaccgcaatc 
ttaacattgt 
aacacgtttt 
cgttagagat 
gcggaagttt 
gctacggcgg 
gcggaagctc 
gctccagcgg 
gcggcggcgg 
acggcggcgg 
cttcatctaa 
cttcctgtga 
ttagagacgg 
ggcatgatca 
tttccttctt 
attttttttc 
aagaggtggc 
actactatta 
atgcatggaa 
ccttgaaaat 
accacaaagt 
ttcaataaac 
agtacataat 



tttttaaatc 
agcacaatgt 
agtctcatca 
gtgctacttg 
tacaatgagt 
aaatcatgat 
ccaacatggc 
gaatttctga 
tcatgagtta 
cacatagtct 
tttcagaaag 
ataattcaaa 
ttcccaagat 
agccatatct 
ctaacaactg 
gacttcaggc 
gtggttttta 
caaagttaaa 
caatattaaa 
ctattacttc 
agttgctttt 
ttgatatcat 
tgacatcaac 
gatgcaaatt 
gacacaaaag 
ataaatcttt 
tgtgaatgtg 
gagaagccaa 
tgaaaaggta 
ttcctcaccc 
taataacatt 
acaagctctg 
tgacttcaac 
aaatgcagag 
agtactacta 
aaattctgtt 
caccatactc 
tccctggaag 
cacgcccaga 
cagaatactg 
acctaccgca 
gtttctttta 
gaatgagagg 
tcactttctc 
tttcaaagga 
gcttattttc 
aaccgaatgt 
cggcggcggc 
cggccacggc 

cggcggcggc 

cggccacggc 
cggctacggg 
cagctccagc 
gggaccaagg 
tggttttttt 
ttccgtttgc 
aaatgtcata 
tttacaatat 
catcattcac 
gcccgacggt 
aactgcatca 
aacaaagtct 
aaggtgtcta 
gttctttaat 
tttcttactg 
actctttaaa 



cagctttaag 
ggctgttcac 
aagataaaca 
tcttaagagg 
ccacagttaa 
atgaatcaga 
aagaaaaata 
attgtaccta 
aacaaatgca 
aaaagaggag 
gaataattcg 
gttaagtata 
tttgtaagag 
gcatctgacc 
ataatgccaa 
tgaagtaagt 
acaatgactt 
tagtattgga 
ggaagaaaat 
taagagaact 
taaatagtct 
gcttgtttca 
ggcctgcgta 
gagagcctga 
ttatactttt 
ctgttcctca 
gaaatgaatg 
tatgaacaac 
aagtaatctt 
ttttccttgt 
gaacagatat 
gagatagaac 
tttatcatac 
ttggaggtaa 
gtgttgatct 
cattttctta 
aactgaacaa 
cctccttggc 
tatccgctct 
aataccaaca 
gcctgctaga 
ttcaatattc 
aagggaggag 
atgaatttgg 
tgggttttaa 
taaattaacg 
gacctcaccc 
tacggcggcg 
ggcagttccg 
tacgggggcg 
ggcagttcca 

ggcggcagct 

ggaggccaca 
tcagcagaaa 
gcgctttaag 
atttgttcac 
ccttaagcat 
ttgcttgtta 
acagatacta 
agagttcttt 
agaggaaaga 
ccaagaaaac 
caaggtgttt 
ggaaagaaaa 
atgcaaacta 
attaaagatt 



tatcttgtcc 
tatgcttacc 
tctgtgaaac 
atgggagatt 
aatatgctag 
acaaaatgtt 
aattttccag 
ttgtgtacca 
ggtgtgagtc 
ggcacactta 
tgtacagaaa 
tgggtaagcc 
aacaacataa 
tcttctcaaa 
catcctgctt 
taagtgatcg 
gggatgccct 
gttatctgag 
gcatattaaa 
gtacatgaca 
ctgcccagat 
ggtatgagaa 
gggtgctgga 
ctgaagagct 
cccagccaaa 
aacaggaaat 
ctgccccggg 
ttgctgaaca 
ccttatagtg 
ttttaatact 
ccagctataa 
tacagtccca 
aaagtttcat 
gaaagagaaa 
aaaagtgctg 
ttcttctata 
ggaagtcact 
agaaacagaa 
ggaagaacag 
actcctggat 
aggagaggga 
cagatagcaa 
agagggtaga 
cccaaaatga 
aatatatacc 
gcaactggaa 
cgtttagttc 
gaagctccgg 
gcggcggcta 
gaagctccag 
gcggcggcta 
ccggcggcgg 
agtcctcctc 
ctagctgggg 
ctctagagtt 
aaacfcactct 
ttttttgggc 
attactaagg 
acaaaaccag 
catctatggt 
gtctcccttc 
acttctgtct 
tgtggtttct 
acaactttgt 
tcccaatttg 
agtaacccat 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 
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agcagttgaa ggttgatgta tccagaaatt cggaagacag aactattgtc atgccttttc 6300 

taagtttttt aatcatgtat gttcagacca ccgtcagtaa attcactgag taaagtctgt 6360 

aaatccccaa tattactctt taagatacac aatatgtgga aggctcccag ctctctggct 6420 

ttaaattatt tcaatcctgg aaattctgga atatctcaaa tataaccccc aaaataataa 6480 

taa 6483 
<210> 25 

<211> 1871 

<212> DNA 

<213> Homo sapiens 



<400> 25 

agttgtggcc 

cgcacccccg 

atgtctgctt 

tgtgggggag 

tccggaagtt 

agctttggag 

ctctcgggca 

aatgatagat 

ctagaaaata 

cagagcgatt 

gccagcattg 

gacttcagga 

aatggcctgc 

atcgagagcc 

agcttccggg 

gacctcacca 

cggaaggacg 

accaacaccg 

cagaacctgg 

ttggccgaag 

aacctggagg 

cagcggctgc 

ctggacgggg 

tcacaagcac 

gttgtgcagg 

ctaatgtaaa 

aagtagaaat 

gttccatttt 

ttcagtcaca 

ccttctggac 

cttaatcata 

t gaa t gg t ci 

<210>""26 



accttcccca 
gactgtcccg 
ccagtgttgg 
gcttttctgc 
ccatggcagg 
ggctggggat 
atgatggagg 
tagcttccta 
aaattcgaga 
acagcaaata 
gaaatgccca 
tgaagtatga 
gccgggtgct 
tgaacgagga 
tgggcggccc 
ggctcctcaa 
ctgaagcctg 
agcagcttca 
agatcgagct 
ccgagggcga 
cacagctgct 
tgaatgtcaa 
aggcccaagg 
agtcaactga 
agatggtgaa 
atttcacaag 
tattcctttc 
ctttggattc 
agtcattttg 
aactggagta 
gagctttact 



ggccatggat 
gcggctctcc 
aagtggttat 
tgcttccatg 
aggactgggt 
gggatttggg 
ccttct-ttct 
cctggataag 
atggtatgaa 
ttatccactg 
gctcctcttg 
gaatgaactg 
ggacgagctg 
gctggcctac 
aggcgaggtc 
tgatatgcgg 
gttcattgaa 
gtccagcaag 
acagtcccag 
ttactgcgcg 
ccaggtgcgc 
ggcccgcctg 
tgatggtttg 
ttcctctaaa 
tggtgaggtg 
atctgcccca 
agagtaacat 
cctattcaca 
gtcatgttgg 
gtcttttaac 
caagtagtat 



ctctccaaca 
tcgcagagtg 

gggggaagtg 

tttggttcta 
gctggttatg 
ggcagcccag 
ggatcagaaa 
gtgcgagctc 
acacgaggaa 
attgaagacc 
cagattgaca 
gccctgcgcc 
accctgacca 
atgaagaaga 
agcgtagaaa 
gcgcagtatg 
aagagcgggg 
agcgaggtca 
ctcgccatga 
cagctgtccc 
gcggacgcag 
gagctggaga 
gaggaaagtt 
gacccaacca 
gtctcatctc 
tgattggttc 
gctgtattac 
ttgaatcctt 
tctttgtaac 
gaactttctt 
tgttttaata 



acaccatgtc 
tgataggcag 
cctttggctt 
gttccggctt 
ggagagccct 
gaggtggctc 
aagaaactat 
tagaagaggc 
ctgggactgc 
tcaggaataa 
atgcgagact 
agggcgtaga 
ggaccgacct 
accacgagga 
tggacgctgc 
aaaccatcgc 
agctccgtaa 
ccgacctgcg 
agaaatccct 
aggtgcagca 
agcgccagaa 
ttgagaccta 
tatttgtgac 
aaacccgaaa 
aagttcagga 
cttaggaaca 
ttcaatccct 
tttgcccttc 
aaatcaaaat 
ctggtaaccc 
gagttaattg 



actctcagtg 
acccaggggc 
tggagccagc 
tgggggtggc 

gggtggaggt 
tctaggtatt 
gcaaaatctt 
taatactgag 
agatgcttca 
gatcatttca 
agctgctgag 
ggccgacatc 
ggagatgcag 
tgagctccaa 
ccccggagtg 
tgagcagaat 
ggagattagc 
tcgcgccttt 
ggaggactcc 
gctcatcagc 
cgtggaccac 
ccgccgcctg 
agactccaaa 
aatcaagaca 
aattgaagaa 
agaaatttac 
atttttgtct 
tgaaacaata 
taccttatat 
ggaatatttt 
taataaaaga 



60 
120 
180 
240 
300 
36Qj 
420l 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
168<fc 
174^ 
1800 
I860 
1871 



<211> 1447 

<212> DNA 

<213> Homo sapiens 



<400> 26 

ctgcaactgg ttctgcgagg gctccttcaa tggcagcgag aaggagacta tgcagttcct 60 

gaacgaccgc ctggccagct acctggagaa ggtgcgtcac gtggagcggg acaacgcgga 12 0 

gctggagaac ctcatccggg agcggtctca gcagcaggag cccttgctgt gccccagcta 180 

ccagtcctac ttcaagacca ttgaggagct ccagcagaag atcctgtgca gcaagtctga 240 

gaatgccagg ctggtggtgc agatcgacaa tgccaagctg gctgcagatg acttcagaac 300 

caagtaccag acggagcagt ccctgcggca gctggtggag tccgacatca acagcctgcg 360 

caggattctg gatgagctga ccctgtgcag gtctgacctg gaggcccaga tggagtccct 420 
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gaaggaggag 

ccagcttgga 

ggtcctgaac 

ggagcaatgg 

gcagctgcag 

gatcgagctg 

cgaggcccgc 

ccagctggcg 

ggacgtgcgg 

ggactgcaag 

atcctgtgtc 

gtactagata 

ccagttctgc 

atcatggtct 

actgatcctc 

gttggagctg 

tgcagttccc 

aaaaaaa 

<210> 27 



ctgctgtccc 
gaccgcctca 
gagaccagga 
ttcgccacgc 
tcctaccagg 
caggcccagc 
tacagctccc 
gagatccgca 
gcgcggctgg 
ctgccctcca 
accaatcctt 
ccctggggcc 
ctctctgaca 
ggaaggagaa 
ctcaccccag 
tgacagtggc 
caaatcgcct 



tcaagcagaa 
acgtggaggt 
atcagtatga 
agaccgagga 
cggagatcat 
acaacctgcg 
agctgtccca 
gtgacctgga 
agtgtgagat 
acccctgcgc 
gtggtcctcg 
agcagaagta 
accatcagcc 
caaatgccca 
gtccctctcc 
atacgaggtg 
aataaacttt 



ccatgagcag 
ggacgctgct 
ggccctggtg 
gctgaacaag 
cgagctgaga 
atactctctg 
ggtgcagagc 
gcggcagaac 
caacacatac 
caccaccaat 
ttcccgctgt 
tagcatgaag 
accggacccc 
gcgtttgggt 
tgtagtcagt 
ttttgttctc 
cctcttgcaa 



gaagtcaaca 
cccgctgtgg 
gaaaccaacc 
caggtggtat 
cgcacagtca 
gaaaacacgc 
ctgatcacca 
caggagtatc 
cggagcctgc 
gcatgtgaaa 
gggccttgca 
acagaactac 
accccgaggc 
ctgactctga 
ctgagttctg 
tctgctgctt 
agcagacaaa 



ccttgcgctg 
acctgaacca 
gcagggaagt 
ccagctcgga 
atgccctgga 
tgacagagag 
acgtggagtc 
aggtgctgct 
tggagagcga 
agcccattgg 
acacctttgg 
catcggtggg 
atcaccacaa 
gcctagggct 
atggtcagag 
ctacctttat 
aaaaaaaaaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1447 



<211> 261 
<212> PRT 
<213> Homo sapiens 



<400> 27 



Met 


Asn Pro Asn 


Cys 


Ala 


Arg Cys 


1 




5 




Lys 


Val Asn Cys 


Leu 


Asp 


Lys Phe 




20 








Glu 


Thr Cys Lys 


Met 


Thr 


Leu Asn 




35 






40 


Lys 


Lys Pro Tyr 


Cys 


Asn 


Ala His 




50 






55 


Val 


Ala Asp Thr 


Pro 


Glu 


Asn Leu 


65 






70 




Gin 


Ser Gin Val 


Arg 


Tyr 


Lys Glu 






85 






Gly 


Phe Ser Val 


Val 


Ala 


Asp Thr 




100 






Thr 


Gin Asp Gin 


lie 


Ser 


Asn He 




115 






120 


Ser 


Arg Met Gly 


Pro 


Ser 


Gly Gly 




13 0 






135 


Asp 


Ser Gin Asp 


Gly 


Ser 


Ser Tyr 


145 






150 


Pro 


His His lie 


Pro 


Thr 


Ser Ala 






165 






Gin 


Pro Val Ala 


Gin 


Ser 


Tyr Gly 




180 






Val 


Ser lie Gin 


Arg 


Ser 


Ala Pro 




195 






200 


Ala 


Val Tyr Asp 


Tyr 


Ser 


Ala Ala 




210 






215 


Asp 


Gly Asp Thr 


He 


Val 


Asn Val 


225 






230 




Tyr 


Gly Thr Val 


Glu 


Arg 


Thr Gly 






245 




Tyr 


Val Glu Ala 


He 







260 



<210> 28 



Gly 


Lys 


He 


Val Tyr 


Pro Thr Glu 




10 






15 


Trp 


His 


Lys 


Ala Cys 


Phe His Cys 


25 








30 


Met 


Lys 


Asn 


Tyr Lys 


Gly Tyr Glu 








45 




Tyr 


Pro 


Lys 


Gin Ser 


Phe Thr Met 








60 




Arg 


Leu 


Lys 


Gin Gin 


Ser Glu Leu 






75 




80 


Glu 


Phe 


Glu 


Lys Asn 


Lys Gly Lys 




90 






95 


Pro 


Glu 


Leu 


Gin Arg 


He Lys Lys 


105 








110 


Lys 


Tyr 


His 


Glu Glu 


Phe Glu . Lys 








125 


Glu 


Gly 


Met 


Glu Pro Glu Arg Arg 








140 




Arg 


Arg 


Pro 


Leu Glu 


Gin Gin Gin 






155 




160 


Pro 


Val 


Tyr 


Gin Gin 


Pro Gin Gin 




170 






175 


Gly 


Tyr 


Lys 


Glu Pro 


Ala Ala Pro 


185 








190 


Gly 


Gly 


Gly Gly Lys Arg Tyr Arg 








205 




Asp 


Glu 


Asp 


Glu Val 


Ser Phe Gin 








220 




Gin 


Gin 


He 


Asp Asp 


Gly Trp Met 






235 




240 


Asp 


Thr 


Gly Met Leu. Pro Ala Asn 




250 






255 



<211> 478 



BHCQ3 1 051 



<212> PRT 

<213> Homo sapiens 



<400> 28 










Met Val 


Gin 


Lys 


Thr 


Ser Met 


Ser 


1 






5 






Glu lie 


Pro 


Met 


Glu 


Val Phe 


Asp 






20 








Lys Arg 


Lys 


Gly 


Arg 


Phe Lys 


Arg 




35 








40 


Thr Thr 


Ser 


Asn 


Ser 


Phe Val 


Arg 


50 








55 




ser Arg 


Pro 


Ser 


Asp 


Ser Asp Val 


65 








70 




Leu Arg 


Lys 


Glu 


Ala 


Glu Arg Gin 








85 






Lys Thr Lys 


Pro 


Val 


Ala Phe 


Ala 






100 








Pro Ser 


Pro 


Gly 


Asp 


Glu Val 


Pro 




115 








120 


Glu Pro 


Lys 


Asp 


Phe 


Leu His 


He 


130 








135 




Trp lie 


Gly 


Arg 


Leu 


Val Lys 


Glu 


145 








150 




Ser Pro 


Val 


Lys 


Leu 


Asp Ser 


Leu 








165 






Arg Gin 


Asn 


Arg 


Leu 


Gly Ser 


Ser 






180 








Ser Leu Gly 


Asp 


Val 


Val Thr Gly 




195 








200 


Ser Ala 


Lys 


Gin 


Lys 


Gin Lys 


Ser 


210 








215 




Val Val 


Pro 


Ser 


Met 


Arg Pro 


He 


225 








230 




Gly Tyr Glu 


Val 


Thr Asp Met 


Met 








245 






Lys His Arg 


Phe 


Asp Gly Arg 


He 






260 








He Ser 


Leu 


Ala 


Lys 


Arg Ser 


Val 




275 








280 


He He 


Glu 


Arg 


Ser 


Asn Thr 


Arg 


290 








295 




Glu He 


Glu 


Arg 


He 


Phe Glu 


Leu 


305 

T * 


"A "1 — 




ml, — 


310 

TT — "A 


XT 4 *<• 


XJC U 


rxxa 




XiiX 

325 


.l -l c neu 


X&JL0 


Leu Ala 


Pro 


He 


He 


Val Tyr 


He 






340 








Gin Arg Leu 


He 


Lys 


Ser Arg Gly 




355 








360 


Val Gin 


He 


Ala 


Ala 


Ser Glu 


Lys 


370 








375 




Phe Asp 


He 


He 


Leu 


Asp Glu 


Asn 


385 








390 




Leu Ala 


Glu 


Tyr 


Leu 


Glu Ala 


Tyr 








405 






Ser Thr 


Pro 


Pro 


Asn 


Pro Leu 


Leu 






420 








Leu Arg 


Arg 


Ser 


Pro 


Ala Pro 


Val 




435 








440 


Thr Ser 


Leu 


Arg 


Arg Asn Leu Gly 


450 








455 





-32- 



Arg 


Gly 


Pro 


Tyr 


Pro 


Pro 


Ser 


Gin 




10 










15 




Pro 


Ser 


Pro 


Gin 


Gly 


Lys 


Tyr 


Ser 


25 










30 






Ser Asp 


Gly 


Ser 


Thr 


Ser 


Ser 


Asp 










45 








Gin Gly 


Ser 


Ala 


Glu 


Ser 


Tyr 


Thr 








60 










Ser 


Leu 


Giu 


Glu 


Asp 


Arg 


Glu 


Ala 






75 










80 


Ala 


Leu 


Ala 


Gin 


Leu 


Glu 


Lys 


Ala 




90 










95 




Val 


Arg 


Thr 


Asn 


Val 


Gly 


Tyr 


Asn 


105 










110 






Val 


Gin 


Gly 


Val 


Ala 


He 


Thr 


Phe 










125 








Lys 


Glu 


Lys 


Tyr 


Asn 


Asn 


Asp 


Trp 








140 










Gly 


Cys 


Glu 


Val 


Gly 


Phe 


He 


Pro 






155 










160 


Arg 


Leu 


Leu 


Gin 


Glu 


Gin 


Lys 


Leu 




170 










175 




Lys 


Ser 


Gly 


Asp 


Asn 


Ser 


Ser 


Ser 


185 










190 






Thr 


Arg 


Arg 


Pro 


Thr 


Pro 


Pro 


Ala 










205 








Thr 


Glu 


His 


Val 


Pro 


Pro 


Tyr 


Asp 








220 










He 


Leu 


Val 


Gly 


Pro 


Ser 


Leu 


Lys 






235 










240 


Gin 


Lys 


Ala 


Leu 


Phe 


Asp 


Phe 


Leu 




250 










255 




Ser 


He 


Thr 


Arg 


Val 


Thr 


Ala 


Asp 


265 










270 






Leu 


Asn 


Asn 


Pro 


Ser 


Lys 


His 


He 










285 








Ser 


Ser 


Leu 


Ala 


Glu 


Val 


Gin 


Ser 








300 










Ala 


Arg 


Thr 


Leu 


Gin 


Leu 


Val 


Ala 






315 










320 


Pro 






T 


Ssr 


TlFM 

J-» jr «J 


TTU ~- 


Ssr 




330 










335 




Lys 


He 


Thr 


Ser 


Pro 


Lys 


Val 


Leu 


345 










350 






Lys 


Ser 


Gin 


Ser 


Lys 


His 


Leu 


Asn 










365 








Leu 


Ala 


Gin 


Cys 


Pro 


Pro 


Glu 


Met 








380 










Gin 


Leu 


Glu 


Asp 


Ala 


Cys 


Glu 


His 






395 










400 


Trp 


Lys 


Ala 


Thr 


His 


Pro 


Pro 


Ser 




410 










415 




Asn 


Arg 


Thr 


Met 


Ala 


Thr 


Ala 


Ala 


425 










430 






Ser 


Asn 


Leu 


Gin 


Val 


Gin 


Val 


Leu 










445 








Phe 


Trp 


Gly 


Gly 


Leu 


Glu 


Ser 


Ser 



460 



BHC03 1 051 



-33- 

Gln Arg Gly Ser Val Val Pro Gin Glu Gin Glu His Ala Met 

470 475 



465 




<210> 


29 


<211> 


196 


<212> 


PRT 


<213> 


Homo 


<400> 


29 



Met Ser Met Leu Arg Leu Gin Lys Arg Leu Ala Ser Ser Val Leu Arg 

1 5 io 15 

Cys Gly Lys Lys Lys Val Trp Leu Asp Pro Asn Glu Thr Asn Glu He 

20 25 30 

Ala Asn Ala Asn Ser Arg Gin Gin He Arg Lys Leu He Lys Asp Glv 

35 40 45 

Leu He He Arg Lys Pro Val Thr Val His Ser Arg Ala Arg Cys Ara 

50 55 60 

Lys Asn Thr Leu Ala Arg Arg Lys Gly Arg His Met Gly He Gly Lys 
65 70 75 80 

Arg Lys Gly Thr Ala Asn Ala Arg Met Pro Glu Lys Val Thr Trp Met 

85 90 95 

Arg Arg Met Arg He Leu Arg Arg Leu Leu Arg Arg Tyr Arg Glu Ser 

1Q 0 105 no 

Lys Lys He Asp Arg His Met Tyr His Ser Leu Tyr Leu Lys Val Lvs 

US 120 125 

Gly Asn Val Phe Lys Asn Lys Arg He Leu Met Glu His He His Lvs 

!30 135 140 

Leu Lys Ala Asp Lys Ala Arg Lys Lys Leu Leu Ala Asp Gin Ala Glu 
" 5 150 155 leo 

Ala Arg Arg Ser Lys Thr Lys Glu Ala Arg Lys Arg Arg Glu Glu Arg 

165 170 ^ 175 

Leu Gin Ala Lys Lys Glu Glu He He Lys Thr Leu Ser Lys Glu Glu 

"0 185 190 

Glu Thr Lys Lys 

195 
<210> 30 

<211> 1566 

<212> PRT 

<213> Homo sapiens 



<400> 


30 




Met 


Ser 


Ser 


Leu 


1 








Pro 


Trp 


Ser 


Glu 








20 


Val 


Val 


Met 


Ser 






35 




Thr 


Leu 


Gin 


Lys 




50 






Arg 


Leu 


Glu 


Ser 


65 








Ala 


Ser 


Gly 


Thr 


Val 


Gin 


Leu 


Asp 








100 


His 


Gly 


Glu 


Asn 






115 





5 10 is 

Thr He Lys Leu Val Arg Gin Val Met Glu Lys Ar< 

25 30 
Ser Gly Gly His Gin His Leu Val Ser Cys Leu Gli 

40 45 
Ala Leu Lys Val Thr Ser Leu Pro Ala Met Thr Asi 

55 60 
He Ala Gly Gin Asn Gly Leu Gly Ser His Leu Se: 
70 75 80 

Glu Cys Tyr He Thr Ser Asp Met Phe Tyr Val Gl\ 
85 90 95 



105 

Cys Pro Glu Leu Val Gin 
120 125 



110 
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Glu Lys Asn Ser Asp Glu Phe Ser Lys His Leu Lys Gly Leu Val Asn 

130 135 140 

Leu Tyr Asn Leu Pro Gly Asp Asn Lys Leu Lys Thr Lys Met Tyr Leu 
145 150 155 160 

Ala Leu Gin Ser Leu Glu Gin Asp Leu Ser Lys Met Ala lie Met Tyr 

165 170 175 

Trp Lys Ala Thr Asn Ala Gly Pro Leu Asp Lys He Leu His Gly Ser 

180 185 190 

Val Gly Tyr Leu Thr Pro Arg Ser Gly Gly His Leu Met Asn Leu Lys 

195 200 205 

Tyr Tyr Val Ser Pro Ser Asp Leu Leu Asp Asp Lys Thr Ala Ser Pro 

210 215 220 

He He Leu His Glu Asn Asn Val Ser Arg Ser Leu Gly Met Asn Ala 
225 230 235 240 

Ser Val Thr lie Glu Gly Thr Ser Ala Val Tyr Lys Leu Pro He Ala 

245 250 255 

Pro Leu He Met Gly Ser His Pro Val Asp Asn Lys Trp Thr Pro Ser 

260 265 270 

Phe Ser Ser He Thr Ser Ala Asn Ser Val Asp Leu Pro Ala Cys Phe 

275 280 285 

Phe Leu Lys Phe Pro Gin Pro He Pro Val Ser Arg Ala Phe Val Gin 

290 295 300 

Lys Leu Gin Asn Cys Thr Gly He Pro Leu Phe Glu Thr Gin Pro Thr 
305 310 315 320 

Tyr Ala Pro Leu Tyr Glu Leu He Thr Gin Phe Glu Leu Ser Lys Asp 

325 330 335 

Pro Asp Pro He Pro Leu Asn His Asn Met Arg Phe Tyr Ala Ala Leu 

340 345 350 

Pro Gly Gin Gin His Cys Tyr Phe Leu Asn Lys Asp Ala Pro Leu Pro 

355 360 365 

Asp Gly Arg Ser Leu Gin Gly Thr Leu Val Ser Lys He Thr Phe Gin 

370 375 380 

His Pro Gly Arg Val Pro Leu He Leu Asn Leu He Arg His Gin Val 
385 ~ 390 395 400 

Ala Tyr Asn Thr Leu He Gly Ser Cys Val Lys Arg Thr He Leu Lys 

405 410 415 

Glu Asp Ser Pro Gly Leu Leu Gin Phe Glu Val Cys Pro Leu Ser Glu 

420 425 430 

Ser Arg Phe Ser Val Ser Phe Gin His Pro Val Asn Asp Ser Leu Val 

435 440 445 

Cys Val Val Met Asp Val Gin Gly Leu Thr His Val Ser Cys Lys Leu 

450 455 460 

Tyr Lys Gly Leu Ser Asp Ala Leu He Cys Thr Asp Asp Phe He Ala 
465 470 475 480 

Lys Val Val Gin Arg Cys Met Ser He Pro Val Thr Met Arg Ala He 

485 490 495 

Arg Arg Lys Ala Glu Thr He Gin Ala Asp Thr Pro Ala Leu Ser Leu 

500 505 510 

He Ala Glu Thr Val Glu Asp Met Val Lys Lys Asn Leu Pro Pro Ala 

515 520 525 

Ser Ser Pro Gly Tyr Gly Met Thr Thr Gly Asn Asn Pro Met Ser Gly 

530 " 535 540 

Thr Thr Thr Ser Thr Asn Thr Phe Pro Gly Gly Pro He Ala Thr Leu 
545 550 555 560 

Phe Asn Met Ser Met Ser He Lys Asp Arg His Glu Ser Val Gly His 

565 570 575 

Gly Glu Asp Phe Ser Lys Val Ser Gin Asn Pro He Leu Thr Ser Leu 

580 585 590 

Leu Gin He Thr Gly Asn Gly Gly Ser Thr He Gly Ser Ser Pro Thr 

595 " 600 605 

Pro Pro His His Thr Pro Pro Pro Val Ser Ser Met Ala Gly Asn Thr 

610 615 620 

Lys Asn His Pro Met Leu Met Asn Leu Leu Lys Asp Asn Pro Ala Gin 
625 630 635 640 

Asp Phe Ser Thr Leu Tyr Gly Ser Ser Pro Leu Glu Arg Gin Asn Ser 
645 650 655 
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Ser ser Gly Ser Pro Arg Met Glu lie Cys Ser Gly Ser Asn Lys Thr 

660 665 ' 670 

Lys Lys Lys Lys Ser Ser Arg Leu Pro Pro Glu Lys Pro Lys His Gin 

675 680 685 

Thr Glu Asp Asp Phe Gin Arg Glu Leu Phe Ser Met Asp Val Asp Ser 

690 695 700 

Gin Asn Pro lie Phe Asp Val Asn Met Thr Ala Asp Thr Leu Asn Thr 
705 . 710 715 720 

Pro His He Thr Pro Ala Pro Ser Gin Cys Ser Thr Pro Pro Thr Thr 

725 730 735 

Tyr Pro Gin Pro Val Pro His Pro Gin Pro Ser He Gin Arg Met Val 

740 745 750 

Arg Leu Ser Ser Ser Asp Ser He Gly Pro Asp Val Thr Asp He Leu 

755 760 765 

Ser Asp He Ala Glu Glu Ala Ser Lys Leu Pro Ser Thr Ser Asp Asp 

770 775 780 

Cys Pro Ala He Gly Thr Pro Leu Arg Asp ser Ser Ser Ser Gly His 
785 , 790 795 800 

Ser Gin Ser Thr Leu Phe Asp Ser Asp Val Phe Gin Thr Asn Asn Asn 

805 810 815 

Glu Asn Pro Tyr Thr Asp Pro Ala Asp Leu He Ala Asp Ala Ala Glv 

820 825 830 

Ser Pro Ser Ser Asp Ser Pro Thr Asn His Phe Phe His Asp Gly Val 

835 840 845 

Asp Phe Asn Pro Asp Leu Leu Asn Ser Gin Ser Gin Ser Gly Phe Glv 

850 855 860 

Glu Glu Tyr Phe Asp Glu Ser Ser Gin Ser Gly Asp Asn Asp Asp Phe 
865 870 875 880 

Lys Gly Phe Ala Ser Gin Ala Leu Asn Thr Leu Gly Val Pro Met Leu 

885 890 895 

Gly Gly Asp Asn Gly Glu Thr Lys Phe Lys Gly Asn Asn Gin Ala Asp 

900 905 9io 

Thr Val Asp Phe Ser He He Ser Val Ala Gly Lys Ala Leu Ala Pro 

915 920 925 

Ala Asp Leu Met Glu His His Ser Gly Ser Gin Gly Pro Leu Leu Thr 

930 935 940 

Thr Gly Asp Leu Gly Lys Glu Lys Thr Gin Lys Arg Val Lys Glu Gly 
5 950 955 96 q 

Asn Gly Thr Ser Asn Ser Thr Leu Ser Gly Pro Gly Leu Asp Ser Lys 

965 970 975 

Pro Gly Lys Arg Ser Arg Thr Pro Ser Asn Asp Gly Lys Ser Lys Asp 

980 985 990 

Lys Pro Pro Lys Arg Lys Lys Ala Asp Thr Glu Gly Lys Ser Pro Ser 

995 1000 1005 

His Ser Ser Ser Asn Arg Pro Phe Thr Pro Pro Thr Ser Thr Glv 

1010 1015 1020 

Gly Ser Lys Ser Pro Gly Ser Ala Gly Arg Ser Gin Thr Pro Pro 

1025 1030 1035 

Gly Val Ala Thr Pro Pro He Pro Lys He Thr He Gin He Pro 

1040 1045 1050 

Lys Gly Thr Val Met Val Gly Lys Pro Ser Ser His Ser Gin Tyr 

1055 1060 1065 

Thr Ser Ser Gly Ser Val Ser Ser Ser Gly Ser Lys Ser His His 

1?70 1075 ~ 1080 

Ser His Ser Ser Ser Ser Ser Ser Ser Ala Ser Thr Ser Gly Lvs 

1085 1090 1095 

Met Lys Ser Ser Lys Ser Glu Gly Ser Ser Ser Ser Lys Leu Ser 

1100 H05 mo 

Ser Ser Met Tyr Ser Ser Gin Gly Ser Ser Gly Ser Ser Gin Ser 

1115 1120 H25 

Lys Asn Ser Ser Gin Ser Gly Gly Lys Pro Gly Ser Ser Pro He 

1130 1135 ii 4 o 

Thr Lys His Gly Leu Ser Ser Gly Ser Ser Ser Thr Lys Met Lvs 

1145 1150 ii 55 * Y 

Pro Gin Gly Lys Pro Ser Ser Leu Met Asn Pro Ser Leu Ser Lvs 
1160 116S 1170 ** yB 
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Pro Asn lie Ser Pro Ser His Ser Arg Pro Pro Gly Gly Ser Asp 

1X75 1180 HQS 

Lys Leu Ala Ser Pro Met Lys Pro Val Pro Gly Thr Pro Pro Ser 

1190 1195 1200 

Ser Lys Ala Lys Ser Pro lie Ser Ser Gly Ser Gly Gly Ser His 

1205 1210 1215 

Met Ser Gly Thr Ser Ser Ser Ser Gly Met Lys Ser Ser Ser Gly 

1220 " 1225 1230 

Leu Gly Ser Ser Gly Ser Leu Ser Gin Lys Thr Pro Pro Ser Ser 

1235 * 1240 1245 

Asn Ser Cys Thr Ala Ser Ser Ser Ser Phe Ser Ser Ser Gly Ser 

1250 1255 1260 

Ser Met Ser Ser Ser Gin Asn Gin His Gly Ser Ser Lys Gly Lys 

1265 1270 1275 

Ser Pro Ser Arg Asn Lys Lys Pro Ser Leu Thr Ala Val He Asp 

1280 " 1285 1290 

Lys Leu Lys His Gly Val Val Thr Ser Gly Pro Gly Gly Glu Asp 

1295 * 1300 1305 

Pro Leu Asp Gly Gin Met Gly Val Ser Thr Asn Ser Ser Ser His 

1310 °* 1315 1320 

Pro Met Ser Ser Lys His Asn Met Ser Gly Gly Glu Phe Gin Gly 

1325 1330 1335 

Lys Arg Glu Lys Ser Asp Lys Asp Lys Ser Lys Val Ser Thr Ser 

1340 1345 1350 

Gly Ser Ser Val Asp Ser Ser Lys Lys Thr Ser Glu Ser Lys Asn 

1355 1360 1365 

Val Gly Ser Thr Gly Val Ala Lys He He He Ser Lys His Asp 

1370 1375 1380 

Gly Gly Ser Pro Ser He Lys Ala Lys Val Thr Leu Gin Lys Pro 

1385 1390 1395 

Gly Glu Ser Ser Gly Glu Gly Leu Arg Pro Gin Met Ala Ser Ser 

1400 1405 1410 

Lys Asn Tyr Gly Ser Pro Leu He Ser Gly Ser Thr Pro Lys His 

1415 * 1420 1425 

Glu Arg Gly Ser Pro Ser His Ser Lys Ser Pro Ala Tyr Thr Pro 

1430 " 1435 1440 

Gin Asn Leu Asp Ser Glu Ser Glu Ser Gly Ser Ser He Ala Glu 

1445 1450 1455 

Lys Ser Tyr Gin Asn Ser Pro Ser Ser Asp Asp Gly He Arg Pro 

1460 1465 1470 

Leu Pro Glu Tyr Ser Thr Glu Lys His Lys Lys His Lys Lys Glu 

1475 " 1480 1485 

Lys Lys Lys Val Lys Asp Lys Asp Arg Asp Arg Asp Arg Asp Lys 

1490 1495 1500 

Asp Arg Asp Lys Lys Lys Ser His Ser He Lys Pro Glu Ser Trp 

1505 1510 1515 

Ser Lys Ser Pro He Ser Ser Asp Gin Ser Leu Ser Met Thr Ser 

1520 1525 1530 

Asn Thr He Leu Ser Ala Asp Arg Pro Ser Arg Leu Ser Pro Asp 

1535 1540 1545 

Phe Met He Gly Glu Glu Asp Asp Asp Leu Met Asp Val Ala Leu 

1550 1555 1560 

He Gly Asn 

1565 
<210> 31 

<211> 1490 

<212> PRT 
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<400> 31 



Met 


Pro 


Asn 


Ser 


Glu 


Arg 


His 


Gly 


Gly 


Lys 


Lys 


Asp 


Gly 


Ser 


Gly 


Gly 


1 








5 










10 










15 


Ala 


Ser 


Gly 


Thr 


Leu 


Gin 


Pro 


Ser 


Ser 


Gly 


Gly 


Gly 


Ser 


Ser 


Asn 


Ser 








20 










25 










30 






Arg 


Glu 


Arg 


His 


Arg 


Leu 


Val 


Ser 


Lys 


His 


Lys 


Arg 


His 


Lys 


Ser 


Lys 






35 










40 










45 








His 


Ser 


Lys 


Asp 


Met 


Gly 


Leu 


Val 


Thr 


Pro 


Glu 


Ala 


Ala 


Ser 


Leu 


Gly 




50 










55 










60 








Thr 


Val 


lie 


Lys 


Pro 


Leu 


val 


Glu 


Tyr 


Asp 


Asp 


He 


Ser 


Ser 


Asp 


Ser 


65 










70 










75 








80 


Asp 


Thr 


Phe 


Ser 


Asp 


Asp 


Met 


Ala 


Phe 


Lys 


Leu 


Asp 


Arg 


Arg 


Glu 


Asn 










85 










90 










95 




Asp 


Glu 


Arg 


Arg 


Gly 


Ser 


Asp 


Arg 


Ser 


Asp 


Arg 


Leu 


His 


Lys 


His 


Arg 








100 










105 










110 






His 


His 


Gin 


His 


Arg 


Arg 


Ser 


Arg 


Asp 


Leu 


Leu 


Lys 


Ala 


Lys 


Gin 


Thr 






115 










120 










125 








Glu 


Lys 


Glu 


Lys 


Ser 


Gin 


Glu 


Val 


Ser 


Ser 


Lys 


Ser 


Gly 


Ser 


Met 


Lys 




130 










135 










140 








Asp 


Arg 


lie 


Ser 


Gly 


Ser 


Ser 


Lys 


Arg 


Ser 


Asn 


Glu 


Glu 


Thr 


Asp 


Asp 


145 










150 










155 




* 




160 


Tyr 


Gly 


Lys 


Ala 


Gin 


Val 


Ala 


Lys 


Ser 


Ser 


Ser 


Lys 


Glu 


Ser 


Arg 


Ser 










165 










170 










175 




Ser 


Lys 


Leu 


His 


Lys 


Glu 


Lys 


Thr 


Arg 


Lys 


Glu 


Arg 


Glu 


Leu 


Lys 


Ser 








180 










185 










190 




Gly 


His 


Lys 


Asp 


Arg 


Ser 


Lys 


Ser 


His 


Arg 


Lys 


Arg 


Glu 


Thr 


Pro 


Lys 






195 










200 










205 






Ser 


Tyr 


Lys 


Thr 


Val 


Asp 


Ser 


Pro 


Lys 


Arg 


Arg 


Ser 


Arg 


Ser 


Pro 


His 




210 










215 










220 










Arg 


Lys 


Trp 


Ser 


Asp 


Ser 


Ser 


Lys 


Gin 


Asp 


Asp 


Ser 


Pro 


Ser 


Gly 


Ala 


225 










230 










235 








240 


Ser 


Tyr 


Gly 


Gin 


Asp 


Tyr 


Asp 


Leu 


Ser 


Pro 


Ser 


Arg 


Ser 


His 


Thr 


Ser 










245 










250 










255 




Ser 


As n 


Tyr 


Asp 


Ser 


Tyr 


Lys 


Lys 


Ser 


Pro 


Gly 


Ser 


Thr 


Ser 


Arg 


Arg 








260 










265 










270 




Gin 


Ser 


Val 


Ser 


Pro 


Pro 


Tyr 


Lys 


Glu 


Pro 


Ser 


Ala 


Tyr 


Gin 


Ser 


Ser 






275 










280 










285 








Thr 


Arg 


Ser 


Pro 


Ser 


Pro 


Tyr 


Ser 


Arg 


Arg 


Gin 


Arg 


Ser 


Val 


Ser 


Pro 




290 










295 










300 










Tyr 


Ser 


Arg 


Arg 


Arg 


Ser 


Ser 


Ser 


Tyr 


Glu 


Arg 


Ser 


Gly 


Ser 


Tyr 


Ser 


305 










310 










315 






320 


Gly 


Arg 


Ser 


Pro 


Ser 


Pro 


Tyr 


Gly 


Arg 


Arg 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 










325 










330 










335 




Leu 


Ser 


Lys 


Arg 


Ser 


Leu 


Ser 


Arg 


Ser 


Pro 


Leu 


Pro 


Ser 


Arg 


Lys 


Ser 








340 










345 










350 




Met 


Lys 


Ser 


Arg 


Ser 


Arg 


Ser 


Pro 


Ala 


Tyr 


Ser 


Arg 


His 


Ser 


Ser 


Ser 






355 










360 










365 








His 


Ser 


Lys 


Lys 


Lys 


Arg 


Ser 


Ser 


Ser 


Arg 


Ser 


Arg 


His 


Ser 


Ser 


He 




370 










375 










380 










Ser 


Pro 


Val 


Arg 


Leu 


Pro 


Leu 


Asn 


Ser 


Ser 


Leu 


Gly 


Ala 


Glu 


Leu 


Ser 


385 










390 










395 








400 


Arg 


Lys 


Lys 


Lys 


Glu 


Arg 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Lys 


Met 


Asp 










405 










410 










415 


Gly 


Lys 


Glu 


Ser 


Lys 


Gly 


Ser 


Pro 


Val 


Phe 


Leu 


Pro 


Arg 


Lys 


Glu 


Asn 








420 










425 










430 






Ser 


Ser 


Val 


Glu 


Ala 


Lys 


Asp 


Ser 


Gly 


Leu 


Glu 


Ser 


Lys 


Lys 


Leu 


Pro 






435 










440 










445 






Arg 


Ser 


Val 


Lys 


Leu 


Glu 


Lys 


Ser 


Ala 


Pro 


Asp 


Thr 


Glu 


Leu 


Val 


Asn 




450 










455 








460 










Val 


Thr 


His 


Leu 


Asn 


Thr 


Glu 


Val 


Lys 


Asn 


Ser 


Ser 


Asp 


Thr 


Gly 


Lys 


465 










470 










475 






480 


Val 


Lys 


Leu 


Asp 


Glu 


Asn 


Ser 


Glu 


Lys 


His 


Leu 


Val 


Lys 


Asp 


Leu 


Lys 










485 










490 








495 


Ala 


Gin 


Gly 


Thr 


Arg 


Asp 


Ser 


Lys 


Pro 


lie 


Ala 


Leu 


Lys 


Glu 


Glu 


He 








500 










505 








510 
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Val 


Thr 


Pro 


Lys 


Glu 


Thr 


Glu 


Thr 


Ser 


Glu 


Lys 


Glu 


Thr 


Pro 


Pro 


Pro 






515 










520 








525 








Leu 


Pro 


Thr 


He 


Ala 


Ser 


Pro 


Pro 


Pro 


Pro 


Leu 


Pro 


Thr 


Thr 


Thr 


Pro 




530 










535 










540 










Pro 


Pro 


Gin 


Thr 


Pro 


Pro 


Leu 


Pro 


Pro 


Leu 


Pro 


Pro 


He 


Pro 


Ala 


Leu 


545 










550 










555 










560 


Pro 


Gin 


Gin 


Pro 


Pro 


Leu 


Pro 


Pro 


Ser 


Gin 


Pro 


Ala 


Phe 


Ser 


Gin 


Val 










565 










570 










575 




Pro 


Ala 


Ser 


Ser 


Thr 


Ser 


Thr 


Leu 


Pro 


Pro 


Ser 


Thr 


His 


Ser 


Lys 


Thr 








580 










585 










590 




Ser 


Ala 


Val 


Ser 


Ser 


Gin 


Ala 


Asn 


Ser 


Gin 


Pro 


Pro 


Val 


Gin 


Val 


Ser 






595 










600 










605 








Val 


Lys 


Thr 


Gin 


Val 


Ser 


Val 


Thr 


Ala 


Ala 


He 


Pro 


His 


Leu 


Lys 

j mm 


Thr 




610 










615 










620 








Ser 


Thr 


Leu 


Pro 


Pro 


Leu 


Pro 


Leu 


Pro 


Pro 


Leu 


Leu 


Pro 


Gly 


Gly 


Asp 


625 










630 










635 








640 


Asp 


Met 


Asp 


Ser 


Pro 


Lys 


Glu 


Thr 


Leu 


Pro 


Ser 


Lys 


Pro 


Val 


Lys 


Lys 










645 










650 










655 




Glu 


Lys 


Glu 


Gin 


Arg 


Thr 


Arg 


His 


Leu 


Leu 


Thr 


Asp 


Leu 


Pro 


Leu 


Pro 








660 










665 










670 






Pro 


Glu 


Leu 


Pro 


Gly 


Gly 


Asp 


Leu 


Ser 


Pro 


Pro 


Asp 


Ser 


Pro 


Glu 


Pro 






675 










680 










685 








Lys 


Ala 


He 


Thr 


Pro 


Pro 


Gin 


Gin 


Pro 


Tyr 


Lys 


Lys 


Arg 


Pro 


Lys 


He 




690 










695 










700 










Cys 


Cys 


Pro 


Arg 


Tyr 


Gly 


Glu 


Arg 


Arc? 


Gin 


Thr 


Glu 


Ser 


Asp 


Trp 


Gly 


705 










710 










715 










720 


Lys 


Arg 


Cys 


Val 


Asp 


Lys 


Phe 


Asp 


He 


He 


Gly 


He 


He 


Gly 


Glu 


Glv 










725 










730 










735 




Thr 


Tyr 


Gly 


Gin 


Val 


Tyr 


Lys 


Ala 


Arq 


Asp 


Lys 


Asp 


Thr 


Gly 


Glu 


Leu 








740 










745 










750 






Val 


Ala 


Leu 


Lys 


Lvs 


Val 


Arq 


Leu 


ASD 


Asn 


Glu 


Lvs 


Glu 


Glv 


Phe 


Pro 






755 










760 










765 








lie 


Thr 


Ala 


lie 


Arg 


Glu 


He 


Lvs 


He 


Leu 


Arq 


Gin 


Leu 


He 


His 






770 










775 










780 








Ser 


Val 


Val 


Asn 


Met 


Lys 


Glu 


He 


Val 


Thr 


Asp 


Lys 


Gin 


Asp 


Ala 


Leu 


785 










790 










795 










800 


Asp 


Phe 


Lys 


Lys 


Asp 


Lys 


Gly 


Ala 


Phe 


Tyr 


Leu 


Val 


Phe 


Glu 


Tvr 


Met 










805 










810 










815 




Asp 


His 


Asp 


Leu 


Met 


Gly 


Leu 


Leu 


Glu 


Ser 


Gly 


Leu 


Val 


His 


Phe 


Ser 








820 










825 










830 






Glu 


Asp 


His 


He 


Lys 


Ser 


Phe 


Met 


Lvs 


Gin 


Leu 


Met 


Glu 


Glv 


Leu 


Glu 






835 










840 










845 








Tyr 


Cys 


His 


Lys 


Lys 


Asn 


Phe 


Leu 


His 


Arq 


Asp 


He 


Lvs 


Cvs 


Ser 


Asn 




850 










855 










860 










He 


Leu 


Leu 


Asn 


Asn 


Ser 


Gly 


Gin 


He 


Lys 


Leu 


Ala 


Asp 


Phe 


Gly 


Leu 


865 










870 










875 










880 


Ala 


Arg 


Leu 


Tyr 


Asn 


Ser 


Glu 


Glu 


Ser 


Arg 


Pro 


Tyr 


Thr 


Asn 


Lys 


Val 










885 










890 










895 




He 


Thr 


Leu 


Trp 


Tyr 


Arg 


Pro 


Pro 


Glu 


Leu 


Leu 


Leu 


Gly 


Glu 


Glu 


Arg 








900 










905 










910 




Tyr 


Thr 


Pro 


Ala 


He 


Asp 


Val 


Trp 


Ser 


Cys 


Gly 


Cys 


He 


Leu 


Gly 


Glu 






915 










920 










925 








Leu 


Phe 


Thr 


Lys 


Lys 


Pro 


He 


Phe 


Gin 


Ala 


Asn 


Leu 


Glu 


Leu 


Ala 


Gin 




930 










935 










940 










Leu 


Glu 


Leu 


He 


Ser 


Arg 


Leu 


Cys 


Gly 


Ser 


Pro 


Cys 


Pro 


Ala 


Val 


Trp 


945 










950 










955 










960 


Pro 


Asp 


Val 


He 


Lys 


Leu 


Pro 


Tyr 


Phe 


Asn 


Thr 


Met 


Lys 


Pro 


Lys 


Lys 










965 










970 










975 




Gin 


Tyr 


Arg 


Arg 


Arg 


Leu 


Arg 


Glu 


Glu 


Phe 


Ser 


Phe 


He 


Pro 


Ser 


Ala 








980 










985 










990 






Ala 


Leu 


Asp 


Leu 


Leu 


Asp 


His 


Met 


Leu Thr Leu Asp Pro Ser Lys Arg 



995 1000 1005 

Cys Thr Ala Glu Gin Thr Leu Gin Ser Asp Phe Leu Lys Asp Val 

1010 1015 1020 

Glu Leu Ser Lys Met Ala Pro Pro Asp Leu Pro His Trp Gin Asp 

1025 1030 1035 
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Cys His 


Glu 


Leu Trp Ser 


Lys 


1040 






1045 


Gly Val 


Val 


Val Glu Glu 


Pro 


1055 






1060 


Glu Thr 


Thr 


Ser Gly Thr 


Ser 


1070 






1075 


Pro Ala 


Pro 


Pro Gin Pro 


Ala 


1085 






1090 


Gly Asp 


Ala 


He Gly Leu Ala 


1100 






1105 


Ser Glu 


Leu 


Ala Val Leu 


Leu 


1115 






1120 


Leu Ser 


He 


Pro Gin Met 


Ala 


1130 






1135 


Pro Glu 


Met 


Gin Gin Gin 


Leu 


1145 






1150 


Ala Leu 


Thr 


Glu Ala Thr 


Ser 


1160 






1165 


Ala Pro 


Glu 


Glu Ser Leu 


Lys 


1175 






1180 


Leu Pro 


Ser 


Ala Glu Gin 


Met 


1190 






1195 


Ala Asp 


Met 


Gin Asn He 


Leu 


1205 






1210 


Lys Thr 


Gin 


Glu Pro Ala 


Gly 


1220 






1225 


Lys Asn 


Ser Gly Pro Gin Gly 


1235 






1240 


Gin Glu 


Glu 


Ala Ala Ala 


Cys 


1250 






1255 


Lys Arg 


Pro 


Pro Glu Pro 


Pro 


1265 






1270 


Pro Pro 


Leu 


Val Glu Gly Asp 


1280 






1285 


Asn Pro 


Ala 


Val Thr Ala 


Ala 


1295 






1300 


Glu Ala 


Glu 


Pro Pro Gly His 


1310 






1315 


Arg Pro 


Met 


Glu Tyr Ser 


Thr 


1325 






1330 


Gly Asn 


Thr 


Asp Gly Pro 


Glu 


1340 






1345 


Asp Glu 


Arg 


Asn Ser Gly 


Pro 


1355 




1360 


Thr Leu 


Val 


Lys Asn Arg 


Thr 


1370 




1375 


Gly Glu 


Ser 


Ser Ser Tyr 


Gin 


1385 




1390 


Gly Asp 


Gin Asp Leu Arg 


Phe 


1400 






1405 


Pro Val 


Val 


Gly Gin Pro 


Phe 


. 1415 






1420 


Ser Val 


Val 


His Ala Glu 


Thr 


1430 






1435 


Gly Pro 


Gly Thr Thr Gly Ala 


1445 






1450 


Trp Gly 


Gly Pro Thr Gin Ser 


1460 






1465 


Gly Pro 


Thr Arg Val Pro 


Pro 


1475 






1480 


Pro Tyr 








1490 
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Lys Arg 


Arg 


Arg Gin 


Arcr 


VJJ.J.J. CCXi 






1050 






Pro Pro 


Ser 


Lys Thr 


Ser 


Arg Lys 






1065 




Thr Glu 


Pro 


Val Lys 


Asn 


Ser Ser 






1080 






Pro Gly Lys 


Val Glu 


Ser 


Gly Ala 






1095 




Asp lie 


Thr 


Gin Gin 


Leu 


Asn Gin 






1110 






Asn Leu 


Leu 


Gin Ser 


Gin Thr Asp 






1125 






Gin Leu 


Leu 


Asn He 


His 


Ser Asn 






1140 






Glu Ala 


Leu 


Asn Gin 


Ser 


He Ser 






1155 






Gin Gin Gin Asp Ser 


Glu 


Thr Met 






1170 






Glu Ala 


Pro 


Ser Ala 


Pro 


Val He 






1185 






Thr Leu 


Glu 


Ala Ser 


Ser 


Thr Pro 






1200 






Ala Val 


Leu 


Leu Ser 


Gin 


Leu Met 






1215 






Ser Leu 


Glu 


Glu Asn 


Asn 


Ser Asp 

- 






1230 




Pro Arg 


Arg 


Thr Pro 


Thr 


Met Pro 






1245 






Pro Pro 


His 


He Leu 


Pro 


Pro Glu 






1260 






Gly Pro 


Pro 


Pro Pro 


Pro 


Pro Pro 






1275 






Leu Ser 


Ser 


Ala Pro 


Gin 


Glu Leu 






1290 






Leu Leu 


Gin 


Leu Leu 


Ser 


Gin Pro 






1305 






Leu Pro 


His 


Glu His 


Gin 


Ala Leu 






1320 






Arg Pro 


Arg 


Pro Asn 


Arg 


Thr Tyr 






1335 




Thr Gly 


Phe 


Ser Ala 


He 


Asp Thr 






1350 




Ala Leu 


Thr 


Glu Ser 


Leu 


Val Gin 






1365 






Phe Ser 


Gly Ser Leu 


Ser 


His Leu 






1380 






Gly Thr Gly Ser Val 


Gin 


Phe Pro 






1395 






Ala Arg Val 


Pro Leu 


Ala 


Leu His 






1410 






Leu Lys 


Ala 


Glu Gly 


Ser 


Ser Asn 






. 1425 






Lys Leu 


Gin 


Asn Tyr 


Gly Glu Leu 






1440 






Ser Ser 


Ser Gly Ala 


Gly Leu His 






1455 






Ser Ala 


Tyr Gly Lys 


Leu 


Tyr Arg 






1470 






Arg Gly Gly Arg Gly 


Arg Gly Val 






1485 







<211> 381 
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<212> PRT 

<213> Homo sapiens 



<400> 32 

Met Leu Thr Arg Leu Phe Ser Glu Pro Gly Leu Leu Ser Asp Val Pro 

15 10 15 

Lys Phe Ala Ser Trp Gly Asp Gly Glu Asp Asp Glu Pro Arg Ser Asp 

20 25 30 

Lys Gly Asp Ala Pro Pro Pro Pro Pro Pro Ala Pro Gly Pro Gly Ala 

35 40 45 

Pro Gly Pro Ala Arg Ala Ala Lys Pro Val Pro Leu Arg Gly Glu Glu 

50 55 60 

Gly Thr Glu Ala Thr Leu Ala Glu Val Lys Glu Glu Gly Glu Leu Gly 
65 70 75 80 

Gly Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gly Leu Asp Glu Ala 

85 90 95 

Glu Gly Glu Arg Pro Lys Lys Arg Gly Pro Lys Lys Arg Lys Met Thr 

100 105 110 

Lys Ala Arg Leu Glu Arg Ser Lys Leu Arg Arg Gin Lys Ala Asn Ala 

115 120 125 

Arg Glu Arg Asn Arg Met His Asp Leu Asn Ala Ala Leu Asp Asn Leu 

130 135 140 

Arg Lys Val Val Pro Cys Tyr Ser Lys Thr Gin Lys Leu Ser Lys lie 
145 150 155 160 

Glu Thr Leu Arg Leu Ala Lys Asn Tyr lie Trp Ala Leu Ser Glu lie 

165 170 175 

Leu Arg Ser Gly Lys Arg Pro Asp Leu Val Ser Tyr Val Gin Thr Leu 

180 185 190 

Cys Lys Gly Leu Ser Gin Pro Thr Thr Asn Leu Val Ala Gly Cys Leu 

195 200 205 

Gin Leu Asn Ser Arg Asn Phe Leu Thr Glu Gin Gly Ala Asp Gly Ala 

210 ~ 215 220 

Gly Arg Phe His Gly Ser Gly Gly Pro Phe Ala Met His Pro Tyr Pro 
225 ~ 230 235 240 

Tyr Pro Cys Ser Arg Leu Ala Gly Ala Gin Cys Gin Ala Ala Gly Gly 

245 250 255 

Leu Gly Gly Gly Ala Ala His Ala Leu Arg Thr His Gly Tyr Cys Ala 

260 265 270 

Ala Tyr Glu Thr Leu Tyr Ala Ala Ala Gly Gly Gly Gly Ala Ser Pro 

275 280 285 

Asp Tyr Asn Ser Ser Glu Tyr Glu Gly Pro Leu Ser Pro Pro Leu Cys 

290 295 300 

Leu Asn Gly Asn Phe Ser Leu Lys Gin Asp Ser Ser Pro Asp His Glu 
305 310 315 320 

Lvs Ser Tvx His Tvr Ser Met His Tyr Ser Ala Leu Pro Gly Ser Arg 

325 330 335 

His Gly His Gly Leu Val Phe Gly Ser Ser Ala Val Arg Gly Gly Val 

340 345 350 

His Ser Glu Asn Leu Leu Ser Tyr Asp Met His Leu His His Asp Arg 

355 360 " 365 

Gly Pro Met Tyr Glu Glu Leu Asn Ala Phe Phe His Asn 

370 ' 375 380 

<210> 33 

<211> 445 

<212> PRT 



<213> Homo sapiens 
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<400> 33 

Met Ser Lys Leu Pro Arg Glu Leu Thr Arg Asp Leu Glu Arg Ser Leu 

1 5 io * 15 

Pro Ala Val Ala Ser Leu Gly Ser Ser Leu Ser His Ser Gin Ser Leu 

20 25 30 

Ser Ser His Leu Leu Pro Pro Pro Glu Lys Arg Arg Ala lie Ser Asp 

35 40 45 

Val Arg Arg Thr Phe Cys Leu Phe Val Thr Phe Asp Leu Leu Phe lie 

50 55 60 

Ser Leu Leu Trp lie He Glu Leu Asn Thr Asn Thr Gly He Arc* Lvs 
65 70 75 80 

Asn Leu Glu Gin Glu He He Gin Tyr Asn Phe Lys Thr Ser Phe Phe 

85 90 95 

Asp He Phe Val Leu Ala Phe Phe Arg Phe Ser Gly Leu Leu Leu Gly 

100 105 * no 

Tyr Ala Val Leu Gin Leu Arg His Trp Trp Val He Ala Val Thr Thr 

115 120 125 

Leu Val Ser Ser Ala Phe Leu He Val Lys Val He Leu Ser Glu Leu 

130 135 140 

Leu Ser Lys Gly Ala Phe Gly Tyr Leu Leu Pro He Val Ser Phe Val 
145 150 155 160 

Leu Ala Trp Leu Glu Thr Trp Phe Leu Asp Phe Lys Val Leu Pro Gin 

165 170 * 175 

Glu Ala Glu Glu Glu Arg Trp Tyr Leu Ala Ala Gin Val Ala Val Ala 

180 185 190 

Arg Gly Pro Leu Leu Phe Ser Gly Ala Leu Ser Glu Gly Gin Phe Tyr 

195 200 205 

Ser Pro Pro Glu Ser Phe Ala Gly Ser Asp Asn Glu Ser Asp Glu Glu 

210 215 220 

Val Ala Gly Lys Lys Ser Phe Ser Ala Gin Glu Arg Glu Tyr He Arq 
22 5 230 235 ~ 240 

Gin Gly Lys Glu Ala Thr Ala Val Val Asp Gin He Leu Ala Gin Glu 

245 250 255 

Glu Asn Trp Lys Phe Glu Lys Asn Asn Glu Tyr Gly Asp Thr Val Tyr 

260 265 270 

Thr He Glu Val Pro Phe His Gly Lys Thr Phe lie Leu Lys Thr Phe 

275 280 285 

Leu Pro Cys Pro Ala Glu Leu Val Tyr Gin Glu Val He Leu Gin Pro 

290 295 300 

Glu Arg Met Val Leu Trp Asn Lys Thr Val Thr Ala Cys Gin He Leu 
305 310 315 320 

Gin Arg Val Glu Asp Asn Thr Leu He Ser Tyr Asp Val Ser Ala Gly 

325 330 & 335 

Ala Ala Gly Gly Val Val Ser Pro Arg Asp Phe Val Asn Val Arg Arg 

340 345 350 

He Glu Arg Arg Arg Asp Arg Tyr Leu Ser Ser Gly He Ala Thr Ser 

355 360 365 

His Ser Ala Lys Pro Pro Thr His Lys Tyr Val Arg Gly Glu Asn Glv 

370 375 380 

Pro Gly Gly Phe He Val Leu Lys Ser Ala Ser Asn Pro Arg Val Cys 
385 390 395 400 

Thr Phe Val Trp He Leu Asn Thr Asp Leu Lys Gly Arg Leu Pro Arg 

405 410 " ~ 415 

Tyr Leu He His Gin Ser Leu Ala Ala Thr Met Phe Glu Phe Ala Phe 

420 425 43 0 

His Leu Arg Gin Arg He Ser Glu Leu Gly Ala Arg Ala 
435 440 445 

<210> 34 

<211> 167 

<212> PRT 

<213> Homo sapiens 
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<400> 34 








Met Ala Thr 


Ser 


Glu Leu Ser 


Cys 


1 




5 




Arg Arg Glu 


Ala 


Phe Trp Ala 


Glu 




20 






Arg Pro Glu 


Glu 


Gly Cys Ser 


Leu 


35 






40 


Glu Thr Tyr His 


Gin Gin Gly Gin 


50 




55 




Pro Trp Leu 


Met 


Met Arg Met 


Gly 


65 




70 




Tyr Gin Leu 


Pro 


Tyr Gin Arg Val 




85 




Ala Lys Met Gly 


Ala Thr Lys 


Glu 




100 






Leu Gin Glu 


Leu 


Leu Ala Leu 


Glu 


115 






120 


Asp Arg Gin 


Glu 


Val Ala Glu 


lie 


130 




135 




Pro Val Ser 


Lys 


Pro Gly Ala 


Leu 


145 




150 




Ser Gin Glu 


Ala 


Gin Arg Gly 








165 




<210> 35 








<211> 282 








<212> PRT 








<213> Homo 


sapiens 





Glu Val Ser Glu Glu Asn Cys Glu 

10 15 
Trp Lys Asp Leu Thr Leu Ser Thr 
25 " 30 

His Glu Glu Asp Thr Gin Arg His 
45 

Cys Gin Val Leu Val Gin Arg Ser 
60 

lie Leu Gly Arg Gly Leu Gin Glu 

75 " 80 
Leu Pro Leu Pro lie Phe Thr Pro 

90 95 
Glu Arg Glu Asp Thr Pro He Gin 
105 - 110 
Thr Ala Leu Gly Gly Gin Cys Val 
125 

Thr Lys Gin Leu Pro Pro Val Val 
140 

Arg Arg Ser Leu Ser Arg Ser Met 
155 160 



<400> 35 

Met Ser Gly Ala Asp Arg Ser Pro Asn Ala Gly Ala Ala Pro Asp Ser 

1 ^ 5 ~ 10 15 

Ala Pro Gly Gin Ala Ala Val Ala Ser Ala Tyr Gin Arg Phe Glu Pro 

20 25 30 

Arg Ala Tyr Leu Arg Asn Asn Tyr Ala Pro Pro Arg Gly Asp Leu Cys 

35 40 45 

Asn Pro Asn Gly Val Gly Pro Trp Lys Leu Arg Cys Leu Ala Gin Thr 

50 55 60 

Phe Ala Thr Gly Glu Val Ser Gly Arg Thr Leu He Asp He Gly Ser 
65 70 75 80 

Gly Pro Thr Val Tyr Gin Leu Leu Ser Ala Cys Ser His Phe Glu Asp 

85 90 95 

He Thr Met Thr Asp Phe Leu Glu Val Asn Arg Gin Glu Leu Gly Arg 

100 105 HO 

Trp Leu Gin Glu Glu Pro Gly Ala Phe Asn Trp Ser Met Tyr Ser Gin 

115 120 125 

His Ala Cys Leu He Glu Gly Lys Gly Glu Cys Trp Gin Asp Lys Glu 

130 135 140 

Arg Gin Leu Arg Ala Arg Val Lys Arg Val Leu Pro He Asp Val His 
145 ~ 150 155 160 

Gin Pro Gin Pro Leu Gly Ala Gly Ser Pro Ala Pro Leu Pro Ala Asp 

165 170 175 

Ala Leu Val Ser Ala Phe Cys Leu Glu Ala Val Ser Pro Asp Leu Ala 

180 185 190 

Ser Phe Gin Arg Ala Leu Asp His He Thr Thr Leu Leu Arg Pro Gly 

195 200 205 

Gly His Leu Leu Leu He Gly Ala Leu Glu Glu Ser Trp Tyr Leu Ala 

210 215 220 

Gly Glu Ala Arg Leu Thr Val Val Pro Val Ser Glu Glu Glu Val Arg 
225 230 235 240 
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Glu Ala Leu Val Arg Ser Gly Tyr Lys Val Arg Asp Leu Arg Thr Tyr 

245 250 255 

He Met Pro Ala His Leu Gin Thr Gly Val Asp Asp Val Lys Gly Val 

260 265 270 

Phe Phe Ala Trp Ala Gin Lys Val Gly Leu 

275 280 
<210> 36 

<211> 1255 

<212> PRT 

<213> Homo sapiens 



<400> 36 

Met Glu Leu Ala Ala Leu Cys Arg Trp Gly Leu Leu Leu Ala Leu Leu 

1 5 io 15 

Pro Pro Gly Ala Ala Ser Thr Gin Val Cys Thr Gly Thr Asp Met Lys 

20 25 30 

Leu Arg Leu Pro Ala Ser Pro Glu Thr His Leu Asp Met Leu Arg His 

35 40 45 

Leu Tyr Gin Gly Cys Gin Val Val Gin Gly Asn Leu Glu Leu Thr Tyr 

50 55 60 

Leu Pro Thr Asn Ala Ser Leu Ser Phe Leu Gin Asp He Gin Glu Val 
65 7 ° 75 80 

Gin Gly Tyr Val Leu He Ala His Asn Gin Val Arg Gin Val Pro Leu 

85 90 95 

Gin Arg Leu Arg He Val Arg Gly Thr Gin Leu Phe Glu Asp Asn Tyr 

100 105 no 

Ala Leu Ala Val Leu Asp Asn Gly Asp Pro Leu Asn Asn Thr Thr Pro 

115 120 125 

Val Thr Gly Ala Ser Pro Gly Gly Leu Arg Glu Leu Gin Leu Arg Ser 

"0 135 140 

Leu Thr Glu He Leu Lys Gly Gly Val Leu He Gin Arg Asn Pro Gin 
145 150 155 ^ 160 

Leu Cys Tyr Gin Asp Thr He Leu Trp Lys Asp He Phe His Lys Asn 

165 170 175 

Asn Gin Leu Ala Leu Thr Leu He Asp Thr Asn Arg Ser Arg Ala Cys 

180 185 190 

His Pro Cys Ser Pro Met Cys Lys Gly Ser Arg Cys Trp Gly Glu Ser 

195 200 205 

Ser Glu Asp Cys Gin Ser Leu Thr Arg Thr Val Cys Ala Gly Glv Cvs 

210 215 220 

Ala Arg Cys Lys Gly Pro Leu Pro Thr Asp Cys Cys His Glu Gin Cvs 
225 230 235 240 

Ala Ala Gly Cys Thr Gly Pro Lys His Ser Asp Cys Leu Ala Cys Leu 

245 250 255 

His Phe Asn His Ser Gly He Cys Glu Leu His Cys Pro Ala Leu Val 

260 265 A 270 

Thr Tyr Asn Thr Asp Thr Phe Glu Ser Met Pro Asn Pro Glu Gly Arg 

275 280 285 

Tyr Thr Phe Gly Ala Ser Cys Val Thr Ala Cys Pro Tyr Asn Tyr Leu 

290 295 300 

Ser Thr Asp Val Gly Ser Cys Thr Leu Val Cys Pro Leu His Asn Gin 
3 ° 5 310 315 320 

Glu Val Thr Ala Glu Asp Gly Thr Gin Arg Cys Glu Lys Cys Ser Lvs 

325 330 335 

Pro Cys Ala Arg Val Cys Tyr Gly Leu Gly Met Glu His Leu Atq Glu 

340 345 3 5 o 

Val Arg Ala Val Thr Ser Ala Asn He Gin Glu Phe Ala Gly Cvs Lvs 

355 360 365 

Lys He Phe Gly Ser Leu Ala Phe Leu Pro Glu Ser Phe Asp Glv Asn 
370 375 3 8 o * P 
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Pro Ala Ser Asn Thr Ala Pro Leu Gin Pro Glu Gin Leu Gin Val Phe 
385 390 395 400 

Glu Thr Leu Glu Glu lie Thr Gly Tyr Leu Tyr lie Ser Ala Trp Pro 

405 410 415 

Asp Ser Leu Pro Asp Leu Ser Val Phe Gin Asn Leu Gin Val lie Arg 

420 425 430 

Gly Arg lie Leu His Asn Gly Ala Tyr Ser Leu Thr Leu Gin Gly Leu 

435 440 445 

Gly lie Ser Trp Leu Gly Leu Arg Ser Leu Arg Glu Leu Gly Ser Gly 

450 * 455 460 

Leu Ala Leu He His His Asn Thr His Leu Cys Phe Val His Thr Val 
465 470 475 480 

Pro Trp Asp Gin Leu Phe Arg Asn Pro His Gin Ala Leu Leu His Thr 

485 490 495 

Ala Asn Arg Pro Glu Asp Glu Cys Val Gly Glu Gly Leu Ala Cys His 

500 505 510 

Gin Leu Cys Ala Arg Gly His Cys Trp Gly Pro Gly Pro Thr Gin Cys 

515 520 525 

Val Asn Cys Ser Gin Phe Leu Arg Gly Gin Glu Cys Val Glu Glu Cys 

530 535 540 

Arg Val Leu Gin Gly Leu Pro Arg Glu Tyr Val Asn Ala Arg His Cys 
545 550 555 560 

Leu Pro Cys His Pro Glu Cys Gin Pro Gin Asn Gly Ser Val Thr Cys 

565 570 575 

Phe Gly Pro Glu Ala Asp Gin Cys Val Ala Cys Ala His Tyr Lys Asp 

580 585 590 

Pro Pro Phe Cys Val Ala Arg Cys Pro Ser Gly Val Lys Pro Asp Leu 

595 600 605 

Ser Tyr Met Pro He Trp Lys Phe Pro Asp Glu Glu Gly Ala Cys Gin 

610 615 620 

Pro Cys Pro He Asn Cys Thr His Ser Cys Val Asp Leu Asp Asp Lys 
625 630 635 640 

Gly Cys Pro Ala Glu Gin Arg Ala Ser Pro Leu Thr Ser He Val Ser 

645 650 655 

Ala Val Val Gly He Leu Leu Val Val Val Leu Gly Val Val Phe Gly 

660 665 670 

He Leu He Lys Arg Arg Gin Gin Lys He Arg Lys Tyr Thr Met Arg 

675 680 685 

Arg Leu Leu Gin Glu Thr Glu Leu Val Glu Pro Leu Thr Pro Ser Gly 

690 695 700 

Ala Met Pro Asn Gin Ala Gin Met Arg He Leu Lys Glu Thr Glu Leu 
705 710 715 720 

Arc? Lys Val Lys Val Leu Gly Ser Gly Ala Phe Gly Thr Val Tyr Lys 

725 730 735 

Gly He Trp He Pro Asp Gly Glu Asn Val Lys He Pro Val Ala He 

740 745 750 

Lys Val Leu Arg Glu Asn Thr Ser Pro Lys Ala Asn Lys Glu He Leu 

755 760 765 

Asp Glu Ala Tyr Val Met Ala Gly Val Gly Ser Pro Tyr Val Ser Arg 

770 775 780 

Leu Leu Gly He Cys Leu Thr Ser Thr Val Gin Leu Val Thr Gin Leu 
785 790 795 800 

Met Pro Tyr Gly Cys Leu Leu Asp His Val Arg Glu Asn Arg Gly Arg 

805 810 815 

Leu Gly Ser Gin Asp Leu Leu Asn Trp Cys Met Gin He Ala Lys Gly 

820 825 830 

Met Ser Tyr Leu Glu Asp Val Arg Leu Val His Arg Asp Leu Ala Ala 

835 840 845 

Arg Asn Val Leu Val Lys Ser Pro Asn His Val Lys He Thr Asp Phe 

850 855 860 

Gly Leu Ala Arg Leu Leu Asp He Asp Glu Thr Glu Tyr His Ala Asp 
865 " 870 875 880 

Gly Gly Lys Val Pro He Lys Trp Met Ala Leu Glu Ser He Leu Arg 

885 890 895 

Arg Arg Phe Thr His Gin Ser Asp Val Trp Ser Tyr Gly Val Thr Val 
900 905 910 
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Trp 


Glu 


Leu 


Met 


Thr 


Phe 


Gly 


Ala 


Lys 


Pro 


Tyr 


Asp Gly He 


Pro 


Ala 






915 










920 








925 






Arg 


Glu 


He 


Pro 


Asp 


Leu 


Leu 


Glu 


Lys 


Gly 


Glu 


Arg Leu Pro 


Gin 


Pro 




930 










935 










940 






Pro 


He 


Cys 


Thr 


He 


Asp 


Val 


Tyr 


Met 


He 


Met 


Val Lys Cys 


Trp 


Met 


945 










950 










955 


960 


He 


Asp 


Ser 


Glu 


Cys 


Arg 


Pro 


Arg 


Phe 


Arg 


Glu 


Leu Val Ser 


Glu 


Phe 










965 










970 






975 




Ser 


Arg 


Met 


Ala 


Arg 


Asp 


Pro 


Gin 


Arg 


Phe 


Val 


Val He Gin 


Asn 


Glu 








980 










985 






990 






Asp 


Leu 


Gly 


Pro 


Ala 


Ser 


Pro 


Leu 


Asp Ser Thr Phe Tyr Arg Ser Leu 






995 










1000 






1005 







Leu 


Glu 


Asp Asp 


Asp Met 


Gly 


7\ en 


i-lC UL 


V Q J. 


Asp 


Aid 


Glu 


r* 1 n 


Tyr 




1010 








1015 










"1 A O ft 






Leu 


Val 


Pro 


Gin 


Gin Gly Phe 


Phe 




Pro 




Drn 


Ala 


Pro 


Gly 




1025 








1030 










iUib 






Ala 


Gly 


Gly Met 


Val His 


His 




His 


-wig 


Car 
OCX. 


OCI 


Ser 


Thr 


Arg 




1040 








1045 










IAEA 






Ser 


Glv 


Gly Gly 


Asp Leu 


Thr 


liCU 


w-L jr 


Leu 


nl 11 


Pro 


Ser 


\jrJ.U 


Glu 




1055 








1060 










*1 A £ c 

1065 








Glu 


Ala 


Pro 


Arg 


Ser Pro 


Leu 


AT a 

xA. J. Ct 




OCX 


m ii 

uXU 


m i/ 


Ala 


f~% ~] -m m 

<j-i.y 


Ser 




1 070 








1075 










1080 






Asp 


Val 


Phe 


Asp 


Gly Asp 


Leu 


al -vr 

uiy 


net 




Hi =a 


ax a 


Lys 


Gly 


Leu 












1090 










1095 




Gin 


Ser 


Leu 


Pro 


Thr His 


Asp 


Pro 


Ser 


Pro 


Leu 


\j±n 


TV on 

Arg 


Tyr 


Ser 




1100 








1105 










1110 




Glu 


Asp 


Pro 


Thr 


Val Pro 


Leu 


Pro 


Ser 


Glu 


Thr 


Asp 


Gly 


Tyr 


Val 




1115 








1120 










1125 




Ala 


Pro 


Leu 


Thr 


Cys Ser 


Pro 


Gin 


Pro 


Glu 


Tyr 


Val 


Asn 


Gin 


Pro 




1130 








1135 








1140 








Asp 


Val 
1145 


Arg 


Pro 


Gin Pro 


Pro 
1150 


Ser 


Pro 


Arg 


Glu 


Gly 
1155 


Pro 


Leu 


Pro 


Ala 


Ala 


Arg 


Pro 


Ala Gly Ala 


Thr 


Leu 


Glu 


Arg 


Ala 


Lys 


Thr 


Leu 




1160 








1165 








1170 






Ser 


Pro 


Gly Lys 


Asn Gly Val 


Val 


Lys 


Asp 


Val 


Phe 


Ala 


Phe 


Gly 




1175 








1180 










1185 






Gly 


Ala 


Val 


Glu 


Asn Pro 


Glu 


Tyr 


Leu 


Thr 


Pro 


Gin 


Gly 


Gly 


Ala 




1190 








1195 










1200 




Ala 


Pro 


Gin 


Pro 


His Pro 


Pro 


Pro 


Ala 


Phe 


Ser 


Pro 


Ala 


Phe 


Asp 




1205 








1210 










1215 






Asn 


Leu 


Tyr 


Tyr 


Trp Asp 


Gin 


Asp 


Pro 


Pro 


Glu 


Arg 


Gly 


Ala 


Pro 




1220 








1225 










1230 






Pro 


Ser 


Thr 


Phe 


Lys Gly Thr 


Pro 


Thr 


Ala 


Glu. 


Asn 


Pro 


Glu 


Tyr 




1235 








1240 










1245 






Leu 


Gly 


Leu 


Asp 


Val Pro 


Val 



















1250 1255 
<210> 37 



<211> 532 
<212> PRT 
<213> Homo sapiens 



<400> 37 

Met Glu Leu Asp Leu Ser Pro Pro 
1 5 
Leu Trp Pro Ala Pro Gly Thr Pro 
20 

Thr Pro Leu Pro Glu Glu Val Lys 

35 40 
Thr Thr Gly Arg Lys Leu Arg Glu 
50 55 



His Leu Ser Ser Ser Pro Glu Asp 

10 15 
Pro Gly Thr Pro Arg Pro Pro Asp 
25 30 
Arg Ser Gin Pro Leu Leu He Pro 
45 

Glu Glu Arg Arg Ala Thr Ser Leu 
60 
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Pro Ser 


He 


Pro 


Asn 


Pro 


Phe 


Pro 


65 








70 






Ser Pro 


He 


Leu Gly 


Glv 


Pro 


Ser 








85 








Asp Ala 


Ser 


Arg 


XT L. \J 


His 


Val 


Val 






1UU 










Cvs A3TQ 


Ser 


VctJ. 




Val 


Ala 


Ala 


115 










120 


Glu Met 


Leu 


Val 


Gin 


Ara 


Ala 


His 


130 










135 




Leu Val 


Glu 


Cys 


His 


Pro 


His 


Leu 


145 








150 






His Glu 


Ser 


Val 


Val 


Glu 


Val 


Gin 








165 








Ser Arg 


Phe 


Val 


Phe 


Ara 


Lys 


Asn 






180 










Sex* Ser 


Pro 


His 


Ser 


Leu 


Phe 


Pro 
















7\ on 51 a 




Thr Gly 


XJ.C 


Cor 

OCX 


XX X O 


<u IV/ 










one 




Asn Ala. 


Glv 


Ser 


Phe 


Pro 


Glu 


He 


«fi A O 








£ J V 






S^t" Glv 




Lys 


Leu 


xx-p 


Lys 


At*ci 
.Mxy 








245 








Gly Leu 


xyx 


Tyr 


Ser 


Thr 


Lys 


Glv 






260 










wXXl xyx 


v d x 


Ala 


Asp 


Val 

V A X. 


Aan 


Glu 

VJX IX 




O *7 C 










O ft ft 


/til \ J* 7\ >~of 


Lys 


Leu 


Tyr 


vjiy 


Mpf* 
1*1 C L. 


Pro 
















xr X \J nUU 


T.'wd 
J_iy © 


Leu 


Arg 


A en 
noil 


ox y 


XXX £» 
















wlU nop 


f3l ii 


Gin 


Ser 


Arg 


X IIX 


v-y & 








325 








Lys Tyr 


Glv 

\jxy 


Val 


Gin 


Leu 


Tvr 

xyx. 


Lys 






340 










His Leu 


m o 


Pro 


Ser 


w s 


XJC IX 
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Ala 


Tro 


Leu 


610 






615 










620 










Val Ala His Val 


Arg Met Leu Gly Leu Asp Glu Arg 


Glu 


Lvs 


Ser 


Leu 


O a 9 




630 










0 -j 










640 


Gin Met lie Ara 


Gin Leu Ala Gly 


Pro 


Leu 


rue 


Ser 


Glu 


Asn 


Thr 


Leu 




645 




















655 




Gin. Phe Tyr Asn 


Glu Arg Val Val 


He 




Asn 


Ser 


He 


Leu 


Glu 


Arg 


660 










665 










670 






Met Cvs Ala Asd 


Val 


Leu 


Gin 


Gin 


Thr 


aj. a. 


inr 


Gin 


He 


Lys 


Phe 


Pro 


675 








680 










685 








Ser Thr Glv Val 


Asp 


Thr 


Met 


Pro 


Tyr 


Trp 


Asn 


Leu 


Leu 


Pro 


Pro 


Lys 


O ✓ v 






695 










700 










Ara Ptt> Tie* Lv*? 


Glu 


Val 


Leu Thr Asp 


He 


jpne 


Ala 


Lys 


Val 


Leu 


Glu 


f U3 




710 




















720 


xjjro vjxy up vox 


Asp 


Ser Arg Ser 


He 


His 


He 


Phe Asp Thr Leu Leu 




725 










730 










735 




m o net V7Xjr 


Val 


Tyr 


Trp 


Phe 


Cys 


Asn 


Asn 


Leu 


He 


Lys 


Glu 


Leu 


/ u 










745 










750 






XjCU Jjjro UxU XI1X 


Arg 


Lys 


Glu 


His 


Thr 


Leu 


Arg 


Ala 


Val 


Glu 


Leu 


Leu 


*7 








760 










765 








Tvr Ser Tie Phe 

-l.Jr.l- UCJ. J. -L. G c lib 


Cys 


Leu Asp 


Met 


Gin 


Gin 


Val 


Thr 


Leu 


Val 


Leu 


Leu 


770 






775 










780 










uiy nis jixc lieu 


Pro Gly Leu Leu Thr Asp 


Ser 


Ser 


Lys 


Trp 


His 


Ser 


r OS 




790 










795 










800 


jjc h ncL nojj riu 


Pro Gly Thr Ala 


Leu 


Ala 


Lys 


Leu 


Ala 


Val 


Trp 


Cys 




805 










810 










815 




Ala ucU ocl oci 


Tyr 


Ser 


Ser 


His 


Lys Gly Gin Ala Ser Thr Arg Gin 


son 










825 










830 






JjyS JUys AZvy JtlXS 


Arg Glu Asp 


lie Glu Asp Tyr 


He 


Ser 


Leu 


Phe 


Pro 


QIC 

odd 








840 










845 








JU6U nop Asjp Val 


Gin 


Pro 


Ser 


Lys 


Leu 


Met 


Arg 


Leu 


Leu 


Ser 


Ser 


Asn 


Ben 






855 










860 










f2l 11 21 en 7\ can 7A H a 
oxu nop nop /ixa 


Asn 


lie 


Leu 


Ser 


Ser 


Pro 


Thr 


Asp 


Arg 


Ser 


Met 


Ser 


DOS 




870 










875 










880 


Cav Qpv* T.Oll Car 
OCX. OCX UCLl OCX 


Ala 


Ser 


Gin 


Leu 


His 


Thr 


Val 


Asn 


Met 


Arg 


Asp 


Pro 




885 










890 










895 




UcU noli nl y Val 


Leu 


Ala 


Asn 


Leu 


Phe 


Leu 


Leu 


He 


Ser 


Ser 


He 


Leu 


onn 










905 










910 






wiy ocx Hiy xxxx 


Ala 


Gly 


Pro 


His 


Thr 


Gin 


Phe 


Val 


Gin 


Trp 


Phe 


Met 


^7 JLO 








920 










925 








Glu Glu Cys Val 


Asp 


Cys 


Leu 


Glu 


Gin Gly Gly Arg Gly Ser Val 


Leu 


930 






935 










940 










Gin Phe Met Pro 


Phe 


Thr 


Thr 


Val 


Ser 


Glu 


Leu 


Val 


Lys 


Val 


Ser 


Ala 


945 




950 










955 










960 


Met Ser Ser Pro 


Lys 


Val 


Val 


Leu 


Ala 


He Thr Asp 


Leu 


Ser 


Leu 


Pro 




965 










970 










975 




Leu Gly Arg Gin 


Val 


Ala 


Ala 


Lys 


Ala 


He 


Ala 


Ala 


Leu 








980 










985 
















<210> 41 


























<211> 490 


























<212> PRT 


























<213> Homo sapiens 

























<400> 41 

Met Glu Gin Lys Pro Ser Lys Val 
1 5 
Asn Ser Ala Arg Ser Pro Asp Gly 
20 

Cys Ser Leu Lys Thr Ser Met Ser 
35 40 



Glu Cys Gly Ser Asp Pro Glu Glu 

10 15 
Lys Arg Lys Arg Lys Asn Gly Gin 
25 30 
Gly Tyr He Pro Ser Tyr Leu Asp 
45 
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Lys Asp Glu Gin Cys Val Val Cys Gly Asp Lys Ala Thr Gly Tyr His 

50 55 60 

Tyr Arg Cys lie Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Thr 
65 70 75 80 

He Gin Lys Asn Leu His Pro Thr Tyr Ser Cys Lys Tyr Asp Ser Cys 

85 90 95 

Cys Val He Asp Lys He Thr Arg Asn Gin Cys Gin Leu Cys Arg Phe 

100 105 no 

Lys Lys Cys He Ala Val Gly Met Ala Met Asp Leu Val Leu Asp Asp 

H5 120 125 

Ser Lys Arg Val Ala Lys Arg Lys Leu He Glu Gin Asn Arg Glu Arq 

130 135 140 

Arg Arg Lys Glu Glu Met He Arg Ser Leu Gin Gin Arg Pro Glu Pro 
145 150 155 160 

Thr Pro Glu Glu Trp Asp Leu He His He Ala Thr Glu Ala His Arg 

165 170 175 

Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg Lys Phe Leu 

180 185 190 

Pro Asp Asp He Gly Gin Ser Pro He Val Ser Met Pro Asp Gly Asp 

195 200 205 

Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He He Thr Pro 

210 215 220 

Ala He Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met Phe Ser 
225 230 235 240 

Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly Cys Cys Met 

2 45 250 255 

Glu He Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro Glu Ser Asp 

260 265 270 

Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu Gin Leu Lys 

2 ?5 280 285 

Asn Gly Gly Leu Gly Val Val Ser Asp Ala He Phe Glu Leu Gly Lys 

290 295 300 

Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala Leu Leu Gin 
305 310 315 320 

Ala Val Leu Leu Met Ser Thr Asp Arg Ser Gly Leu Leu Cys Val Asp 

325 330 335 

Lys He Glu Lys Ser Gin Glu Ala Tyr Leu Leu Ala Phe Glu His Tyr 

340 345 350 

Val Asn His Arg Lys His Asn He Pro His Phe Trp Pro Lys Leu Leu 

355 360 365 

Met Lys Glu Arg Glu Val Gin Ser Ser He Leu Tyr Lys Gly Ala Ala 

370 375 380 

Ala Glu Gly Arg Pro Gly Gly Ser Leu Gly Val His Pro Glu Gly Gin 
385 390 395 400 

Gin Leu Leu Gly Met His Val Val Gin Gly Pro Gin Val Arg Gin Leu 

405 410 415 

Glu Gin Gin Leu Gly Glu Ala Gly Ser Leu Gin Gly Pro Val Leu Gin 

420 425 430 

His Gin Ser Pro Lys Ser Pro Gin Gin Arg Leu Leu Glu Leu Leu His 

435 440 445 

Arg Ser Gly He Leu His Ala Arg Ala Val Cys Gly Glu Asp Asp Ser 

450 455 460 

Ser Glu Ala Asp Ser Pro Ser Ser Ser Glu Glu Glu Pro Glu Val Cys 
465 470 475 480 

Glu Asp Leu Ala Gly Asn Ala Ala Ser Pro 
485 490 

<210> 42 

<211> 614 

<212> PRT 

<213> Homo sapiens 
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<400> 42 

Met Thr Thr Leu Asp Ser Asn Asn Asn Thr Gly Gly Val lie Thr Tyr 

15 10 15 

He Gly Ser Ser Gly Ser Ser Pro Ser Arg Thr Ser Pro Glu Ser Leu 

20 25 30 

Tyr Ser Asp Asn Ser Asn Gly Ser Phe Gin Ser Leu Thr Gin Gly Cys 

35 40 45 

Pro Thr Tyr Phe Pro Pro Ser Pro Thr Gly Ser Leu Thr Gin Asp Pro 

50 55 60 

Ala Arg Ser Phe Gly Ser He Pro Pro Ser Leu Ser Asp Asp Gly Ser 
65 70 75 80 

Pro Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Phe Tyr Asri 

85 90 95 

Gly Ser Pro Pro Gly Ser Leu Gin Val Ala Met Glu Asp Ser Ser Arg 

100 105 HO 

Val Ser Pro Ser Lys Ser Thr Ser Asn He Thr Lys Leu Asn Gly Met 

115 120 125 

Val Leu Leu Cys Lys Val Cys Gly Asp Val Ala Ser Gly Phe His Tyr 

130 135 140 

Gly Val Leu Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser He 
145 150 155 160 

Gin Gin Asn He Gin Tyr Lys Arg Cys Leu Lys Asn Glu Asn Cys Ser 

165 170 175 

He Val Arg He Asn Arg Asn Arg Cys Gin Gin Cys Arg Phe Lys Lys 

180 185 190 

Cys Leu Ser Val Gly Met Ser Arg Asp Ala Val Arg Phe Gly Arg lie 

195 200 205 

Pro Lys Arg Glu Lys Gin Arg Met Leu Ala Glu Met Gin Ser Ala Met 

210 ' 215 220 

Asn Leu Ala Asn Asn Gin Leu Ser Ser Gin Cys Pro Leu Glu Thr Ser 
225 230 235 240 

Pro Thr Gin His Pro Thr Pro Gly Pro Met Gly Pro Ser Pro Pro Pro 

245 250 255 

Ala Pro Val Pro Ser Pro Leu Val Gly Phe Ser Gin Phe Pro Gin Gin 

260 265 270 

Leu Thr Pro Pro Arg Ser Pro Ser Pro Glu Pro Thr Val Glu Asp Val 

275 280 285 

He Ser Gin Val Ala Arg Ala His Arg Glu He Phe Thr Tyr Ala His 

290 295 300 

Asp Lys Leu Gly Ser Ser Pro Gly Asn Phe Asn Ala Asn His Ala Ser 
305 * 310 315 320 

Gly Ser Pro Pro Ala Thr Thr Pro His Arg Trp Glu Asn Gin Gly Cys 

325 330 335 

Pro Pro Ala Pro Asn Asp Asn Asn Thr Leu Ala Ala Gin Arg His Asn 

340 345 350 

Glu Ala Leu Asn Gly Leu Arg Gin Ala Pro Ser Ser Tyr Pro Pro Thr 

355 360 365 

Trp Pro Pro Gly Pro Ala His His Ser Cys His Gin Ser Asn Ser Asn 

370 375 380 

Gly His Arg Leu Cys Pro Thr His Val Tyr Ala Ala Pro Glu Gly Lys 
385 390 395 400 

Ala Pro Ala Asn Ser Pro Arg Gin Gly Asn Ser Lys Asn Val Leu Leu 

405 410 415 

Ala Cys Pro Met Asn Met Tyr Pro His Gly Arg Ser Gly Arg Thr Val 

420 425 430 

Gin Glu He Trp Glu Asp Phe Ser Met Ser Phe Thr Pro Ala Val Arg 

435 " 440 445 

Glu Val Val Glu Phe Ala Lys His He Pro Gly Phe Arg Asp Leu Ser 

450 455 460 

Gin His Asp Gin Val Thr Leu Leu Lys Ala Gly Thr Phe Glu Val Leu 
465 470 475 480 

Met Val Arg Phe Ala Ser Leu Phe Asn Val Lys Asp Gin Thr Val Met 

485 490 495 

Phe Leu Ser Arg Thr Thr Tyr Ser Leu Gin Glu Leu Gly Ala Met Gly 
500 505 510 
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Met, Gly Asp 


Leu 


Leu 


Ser 


Ala Met Phe 


515 








520 


Ser Leu Ala 


Leu 


Thr 


Glu 


Glu Glu Leu 


530 








535 


Leu Val Ser 


Ala 


Asp 


Arcr 


Ser Glv Met 


545 






CCf\ 


Gin Leu Gin 


Glu 


Thr 


Leu 


Leu Arg Ala 






565 






Asn Arg Pro 


Leu 


Glu 


Thr 


Ser Arg Phe 




580 






585 


Pro Asp Leu 


Arg 


Thr 


Leu 


Asn Asn Met 


595 








600 


Phe Arg Val 


Asp 


Ala 


Gin 




610 










<210> 43 










<211> 703 










<212> PRT 










<213> Homo sapiens 







•53- 

Asp Phe Ser Glu Lys Leu Asn 
525 

Gly Leu Phe Thr Ala Val Val 
540 

Glu Asn Ser Ala Ser Val Glu 
555 560 
Leu Arg Ala Leu Val Leu Lys 
570 575 
Thr Lys Leu Leu Leu Lys Leu 
590 

His Ser Glu Lys Leu Leu Ser 
605 



<400> 43 

Met Ala Asp Arg Arg Arg Gin Arg Ala Ser Gin Asp Thr Glu Asp Glu 

1 5 io 15 

Glu Ser Gly Ala Ser Gly Ser Asp Ser Gly Gly Ser Pro Leu Arg Glv 

20 25 " - 30 

Gly Gly Ser Cys Ser Gly Ser Ala Gly Gly Gly Gly Ser Gly Ser Leu 

35 40 45 

Pro Ser Gin Arg Gly Gly Arg Thr Gly Ala Leu His Leu Arg Arg Val 

50 55 60 

Glu Ser Gly Gly Ala Lys Ser Ala Glu Glu Ser Glu Cys Glu Ser Glu 
65 70 75 80 

Asp Gly lie Glu Gly Asp Ala Val Leu Ser Asp Tyr Glu Ser Ala Glu 

85 90 95 

Asp Ser Glu Gly Glu Glu Gly Glu Tyr Ser Glu Glu Glu Asn Ser Lys 

100 105 no 

Val Glu Leu Lys Ser Glu Ala Asn Asp Ala Val Asn Ser Ser Thr Lys 

115 120 125 

Glu Glu Lys Gly Glu Glu Lys Pro Asp Thr Lys Ser Thr Val Thr Glv 

130 135 140 

Glu Arg Gin Ser Gly Asp Gly Gin Glu Ser Thr Glu Pro Val Glu Asn 
145 150 155 160 

Lys Val Gly Lys Lys Gly Pro Lys His Leu Asp Asp Asp Glu Asp Arg 

165 170 ~ 17 5 

Lys Asn Pro Ala Tyr lie Pro Arg Lys Gly Leu Phe Phe Glu His Asp 

180 185 190 

Leu Arg Gly Gin Thr Gin Glu Glu Glu Val Arg Pro Lys Gly Arg Gin 

195 200 ~ 205 

Arg Lys Leu Trp Lys Asp Glu Gly Arg Trp Glu His Asp Lys Phe Arg 
m 210 215 220 

Glu Asp Glu Gin Ala Pro Lys Ser Arg Gin Glu Leu He Ala Leu Tyr 
225 230 235 2 40 

Gly Tyr Asp He Arg Ser Ala His Asn Pro Asp Asp He Lys Pro Arg 

245 250 255 

Arg He Arg Lys Pro Arg Tyr Gly Ser Pro Pro Gin Arg Asp Pro Asn 

260 265 270 

Trp Asn Gly Glu Arg Leu Asn Lys Ser His Arg His Gin Gly Leu Glv 

275 280 ~ 285 

Gly Thr Leu Pro Pro Arg Thr Phe He Asn Arg Asn Ala Ala Glv Thr 

290 295 300 

Gly Arg Met Ser Ala Pro Arg Asn Tyr Ser Arg Ser Gly Gly Phe Lvs 
305 310 315 320 
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Glu Gly Arg Ala Gly Phe Arg Pro Val Glu Ala Gly Gly Gin His Gly 

325 330 335 

Gly Arg Ser Gly Glu Thr Val Lys His Glu He Ser Tyr Arg Ser Arg 

340 345 350 

Arg Leu Glu Gin Thr Ser Val Arg Asp Pro Ser Pro Glu Ala Asp Ala 

355 360 365 

Pro Val Leu Gly Ser Pro Glu Lys Glu Glu Ala Ala Ser Glu Pro Pro 

370 " 375 380 

Ala Ala Ala Pro Asp Ala Ala Pro Pro Pro Pro Asp Arg Pro He Glu 
385 390 395 400 

Lys Lys Ser Tyr Ser Arg Ala Arg Arg Thr Arg Thr Lys Val Gly Asp 

405 410 415 

Ala Val Lys Leu Ala Glu Glu Val Pro Pro Pro Pro Glu Gly Leu He 

420 425 430 

Pro Ala Pro Pro Val Pro Glu Thr Thr Pro Thr Pro Pro Thr Lys Thr 

435 440 445 

Gly Thr Trp Glu Ala Pro Val Asp Ser Ser Thr Ser Gly Leu Glu Gin 

450 *" 455 460 

Asp Val Ala Gin Leu Asn He Ala Glu Gin Asn Trp Ser Pro Gly Gin 
465 470 475 480 

Pro Ser Phe Leu Gin Pro Arg Glu Leu Arg Gly Met Pro Asn His He 

485 490 495 

His Met Gly Ala Gly Pro Pro Pro Gin Phe Asn Arg Met Glu Glu Met 

500 505 510 

Gly Val Gin Gly Gly Arg Ala Lys Arg Tyr Ser Ser Gin Arg Gin Arg 

515 520 525 

Pro Val Pro Glu Pro Pro Ala Pro Pro Val His He Ser He Met Glu 

530 535 540 

Gly His Tyr Tyr Asp Pro Leu Gin Phe Gin Gly Pro He Tyr Thr His 
545 * 550 555 560 

Gly Asp Ser Pro Ala Pro Leu Pro Pro Gin Gly Met Leu Val Gin Pro 

565 570 575 

Gly Met Asn Leu Pro His Pro Gly Leu His Pro His Gin Thr Pro Ala 

580 585 590 

Pro Leu Pro Asn Pro Gly Leu Tyr Pro Pro Pro Val Ser Met Ser Pro 

595 600 605 

Gly Gin Pro Pro Pro Gin Gin Leu Leu Ala Pro Thr Tyr Phe Ser Ala 

610 615 620 

Pro Gly Val Met Asn Phe Gly Asn Pro Ser Tyr Pro Tyr Ala Pro Gly 
625 630 635 640 

Ala Leu Pro Pro Pro Pro Pro Pro His Leu Tyr Pro Asn Thr Gin Ala 

645 650 655 

Pro Ser Gin Val Tyr Gly Gly Val Thr Tyr Tyr Asn Pro Ala Gin Gin 

660 665 670 

Gin Val Gin Pro Lys Pro Ser Pro Pro Arg Arg Thr Pro Gin Pro Val 

675 " 680 685 

Thr He Lys Pro Pro Pro Pro Glu Val Val Ser Arg Gly Ser Ser 

690 695 700 

<210> 44 

<211> 560 

<212> PRT 

<213> Homo sapiens 



<400> 44 

Met Pro Gin Thr Arg Ser Gin Ala 
1 5 
Arg Lys Leu Ser Arg Ala Leu Asn 
20 

Lys Leu Glu Pro Thr Asn Val Gin 
35 40 



Gin Ala Thr He Ser Phe Pro Lys 

10 15 
Lys Ala Lys Asn Ser Ser Asp Ala 
25 30 
Thr Val Thr Cys Ser Pro Arg Val 
45 
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Lys Ala Leu Pro Leu Ser Pro Arg Lye Arg Leu Gly Asp Asp Asn Leu 

50 55 60 

Cys Asn Thr Pro His Leu Pro Pro Cys Ser Pro Pro Lys Gin Gly Lys 
6 5 70 75 80 

Lys Glu Asn Gly Pro Pro His Ser His Thr Leu Lys Gly Arg Arg Leu 

85 90 95 

Val Phe Asp Asn Gin Leu Thr lie Lys Ser Pro Ser Lys Arg Glu Leu 

100 105 no 

Ala Lys Val His Gin Asn Lys lie Leu Ser Ser Val Arg Lys Ser Gin 

115 120 125 

Glu lie Thr Thr Asn Ser Glu Gin Arg Cys Pro Leu Lys Lys Glu Ser 

130 135 140 

Ala Cys Val Arg Leu Phe Lys Gin Glu Gly Thr Cys Tyr Gin Gin Ala 
145 150 155 160 

Lys Leu Val Leu Asn Thr Ala Val Pro Asp Arg Leu Pro Ala Arg Glu 

165 170 175 

Arg Glu Met Asp Val lie Arg Asn Phe Leu Arg Glu His He Cys Gly 

180 185 190 

Lys Lys Ala Gly Ser Leu Tyr Leu Ser Gly Ala Pro Gly Thr Gly Lys 

195 200 205 

Thr Ala Cys Leu Ser Arg He Leu Gin Asp Leu Lys Lys Glu Leu Lys 

210 215 220 

Gly Phe Lys Thr He Met Leu Asn Cys Met Ser Leu Arg Thr Ala Gin 
22 5 230 235 240 

Ala Val Phe Pro Ala He Ala Gin Glu He Cys Gin Glu Glu Val Ser 

245 250 255 

Arg Pro Ala Gly Lys Asp Met Met Arg Lys Leu Glu Lys His Met Thr 

260 265 270 

Ala Glu Lys Gly Pro Met He Val Leu Val Leu Asp Glu Met Asp Gin 

275 280 285 

Leu Asp Ser Lys Gly Gin Asp Val Leu Tyr Thr Leu Phe Glu Trp Pro 

290 295 300 

Trp Leu Ser Asn Ser His Leu Val Leu He Gly He Ala Asn Thr Leu 
305 310 315 320 

Asp Leu Thr Asp Arg He Leu Pro Arg Leu Gin Ala Arg Glu Lys Cys 

325 330 335 

Lys Pro Gin Leu Leu Asn Phe Pro Pro Tyr Thr Arg Asn Gin He Val 

340 345 350 

Thr He Leu Gin Asp Arg Leu Asn Gin Val Ser Arg Asp Gin Val Leu 

355 360 365 

Asp Asn Ala Ala Val Gin Phe Cys Ala Arg Lys Val Ser Ala Val Ser 

370 375 380 

Gly Asp Val Arg Lys Ala Leu Asp Val Cys Arg Arg Ala He Glu He 
385 390 395 400 

Val Glu Ser Asp Val Lys Ser Gin Thr He Leu Lys Pro Leu Ser Glu 

405 410 * 415 

Cys Lys Ser Pro Ser Glu Pro Leu He Pro Lys Arg Val Gly Leu He 

420 425 430 

His He Ser Gin Val He Ser Glu Val Asp Gly Asn Arg Met Thr Leu 

435 440 445 

Ser Gin Glu Gly Ala Gin Asp Ser Phe Pro Leu Gin Gin Lys He Leu 

450 455 460 

Val Cys Ser Leu Met Leu Leu He Arg Gin Leu Lys He Lys Glu Val 
465 470 475 480 

Thr Leu Gly Lys Leu Tyr Glu Ala Tyr Ser Lys Val Cys Arg Lys Gin 

485 490 495 

Gin Val Ala Ala Val Asp Gin Ser Glu Cys Leu Ser Leu Ser Gly Leu 

500 505 sio 

Leu Glu Ala Arg Gly He Leu Gly Leu Lys Arg Asn Lys Glu Thr Arg 

515 520 525 

Leu Thr Lys Val Phe Phe Lys He Glu Glu Lys Glu He Glu His Ala 

530 535 540 

Leu Lys Asp Lys Ala Leu He Gly Asn He Leu Ala Thr Gly Leu Pro 
545 550 555 560 

<210> 45 
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<211> 462 
<212> PRT 
<213> Homo sapiens 



<400> 45 

Met Ala Ser Asn Ser Ser Ser Cys Pro Thr Pro Gly Gly Gly His Leu 

15 10 15 

Asn Gly Tyr Pro Val Pro Pro Tyr Ala Phe Phe Phe Pro Pro Met Leu 

20 25 30 

Gly Gly Leu Ser Pro Pro Gly Ala Leu Thr Thr Leu Gin His Gin Leu 

35 40 45 

Pro Val Ser- Gly Tyr Ser Thr Pro- Ser Pro Ala Thr lie Glu Thr Gin 

50 55 60 

Ser Ser Ser Ser Glu Glu lie Val Pro Ser Pro Pro Ser Pro Pro Pro 
65 70 75 80 

Leu Pro Arg He Tyr Lys Pro Cys Phe Val Cys Gin Asp Lys Ser Ser 

85 90 95 

Gly Tyr His Tyr Gly Val Ser Ala Cys Glu Gly Cys Lys Gly Phe Phe 

100 105 HO 

Arg Arg Ser He Gin Lys Asn Met Val Tyr Thr Cys His Arg Asp Lys 

115 120 125 

Asn Cys He He Asn Lys Val Thr Arg Asn Arg Cys Gin Tyr Cys Arg 

. 130 - • - 135 140 

Leu Gin Lys Cys Phe Glu Val Gly Met Ser Lys Glu Ser Val Arg Asn 
145 J 150 155 160 

Asp Arg Asn Lys Lys Lys Lys Glu Val Pro Lys Pro Glu Cys Ser Glu 

165 170 175 

Ser Tyr Thr Leu Thr Pro Glu Val Gly Glu Leu He Glu Lys Val Arg 

180 185 190 

Lys Ala His Gin Glu Thr Phe Pro Ala Leu Cys Gin Leu Gly Lys Tyr 

195 200 205 

Thr Thr Asn Asn Ser Ser Glu Gin Arg Val Ser Leu Asp He Asp Leu 

210 215 220 

Trp Asp Lys Phe Ser Glu Leu Ser Thr Lys Cys He He Lys Thr Val 
225 230 235 240 

Glu Phe Ala Lys Gin Leu Pro Gly Phe Thr Thr Leu Thr He Ala Asp 

245 250 255 

Gin He Thr Leu Leu Lys Ala Ala Cys Leu Asp He Leu He Leu Arg 

260 265 270 

He Cys Thr Arg Tyr Thr Pro Glu Gin Asp Thr Met Thr Phe Ser Asp 

275 280 285 

Gly Leu Thr Leu Asn Arg Thr Gin Met His Asn Ala Gly Phe Gly Pro 

290 295 300 

Leu Thr Asp Leu Val Phe Ala Phe Ala Asn Gin Leu Leu Pro Leu Glu 
305 310 315 320 

Met Asp Asp Ala Glu Thr Gly Leu Leu Ser Ala He Cys Leu He Cys 

325 330 335 

Gly Asp Arg Gin Asp Leu Glu Gin Pro Asp Arg Val Asp Met Leu Gin 

340 345 350 

Glu Pro Leu Leu Glu Ala Leu Lys Val Tyr Val Arg Lys Arg Arg Pro 

355 360 365 

Ser Arg Pro His Met Phe Pro Lys Met Leu Met Lys He Thr Asp Leu 

370 375 380 

Arg Ser He Ser Ala Lys Gly Ala Glu Arg Val He Thr Leu Lys Met 
385 390 395 400 

Glu He Pro Gly Ser Met Pro Pro Leu He Gin Glu Met Leu Glu Asn 

405 410 415 

Ser Glu Gly Leu Asp Thr Leu Ser Gly Gin Pro Gly Gly Gly Gly Arg 

420 425 430 

Asp Gly Gly Gly Leu Ala Pro Pro Pro Gly Ser Cys Ser Pro Ser Leu 
435 440 445 
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Ser Pro Ser Ser Asn Arg Ser Ser Pro Ala Thr His Ser Pro 

450 455 460 

<210> 46 

<211> 1531 

<212> PRT 

<213> Homo sapiens 



<400> 46 

Met Glu Val Ser Pro Leu Gin Pro Val Asn Glu Asn Met Gin Val Asn 

15 10 15 

Lys lie Lys Lys Asn Glu Asp Ala Lys Lys Arg Leu Ser Val Glu Arg 

20 25 30 

lie Tyr Gin Lys Lys Thr Gin Leu Glu His He Leu Leu Arg Pro Asp 

35 40 45 

Thr Tyr He Gly Ser Val Glu Leu Val Thr Gin Gin Met Trp Val Tyr 

50 55 60 

Asp Glu Asp Val Gly He Asn Tyr Arg Glu Val Thr Phe Val Pro Gly 
65 70 75 80 

Leu Tyr Lys He Phe Asp Glu He Leu Val Asn Ala Ala Asp Asn Lys 

85 90 95 

Gin Arg Asp Pro Lys Met Ser Cys He Arg Val Thr He Asp Pro Glu 

100 105 110 

Asn Asn Leu He Ser He Trp Asn Asn Gly Lys Gly He Pro Val Val 

115 120 125 

Glu His Lys Val Glu Lys Met Tyr Val Pro Ala Leu He Phe Gly Gin 

130 135 140 

Leu Leu Thr Ser Ser Asn Tyr Asp Asp Asp Glu Lys Lys Val Thr Gly 
145 150 155 160 

Gly Arg Asn Gly Tyr Gly Ala Lys Leu Cys Asn He Phe Ser Thr Lys 

165 170 175 

Phe Thr Val Glu Thr Ala Ser Arg Glu Tyr Lys Lys Met Phe Lys Gin 

180 185 190 

Thr Trp Met Asp Asn Met Gly Arg Ala Gly Glu Met Glu Leu Lys Pro 

195 200 205 

Phe Asn Gly Glu Asp Tyr Thr Cys He Thr Phe Gin Pro Asp Leu Ser 

210 * ~ 215 220 

Lys Phe Lys Met Gin Ser Leu Asp Lys Asp He Val Ala Leu Met Val 
225 230 235 240 

Arg Arg Ala Tyr Asp He Ala Gly Ser Thr Lys Asp Val Lys Val Phe 

245 250 255 

Leu Asn Gly Asn Lys Leu Pro Val Lys Gly Phe Arg Ser Tyr Val Asp 

260 265 270 

Met Tyr Leu Lys Asp Lys Leu Asp Glu Thr Gly Asn Ser Leu Lys Val 

275 280 285 

He His Glu Gin Val Asn His Arg Trp Glu Val Cys Leu Thr Met Ser 

290 295 300 

Glu Lys Gly Phe Gin Gin He Ser Phe Val Asn Ser He Ala Thr Ser 
305 310 315 320 

Lys Gly Gly Arg His Val Asp Tyr Val Ala Asp Gin He Val Thr Lys 

325 330 335 

Leu Val Asp Val Val Lys Lys Lys Asn Lys Gly Gly Val Ala Val Lys 

340 345 350 

Ala His Gin Val Lys Asn His Met Trp He Phe Val Asn Ala Leu He 

355 360 365 

Glu Asn Pro Thr Phe Asp Ser Gin Thr Lys Glu Asn Met Thr Leu Gin 

370 375 380 

Pro Lys Ser Phe Gly Ser Thr Cys Gin Leu Ser Glu Lys Phe He Lys 
385 390 395 400 

Ala Ala He Gly Cys Gly He Val Glu Ser He Leu Asn Trp Val Lys 
405 41 ° 415 
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Phe Lys Ala Gin Val Gin Leu Asn Lys Lys Cys Ser Ala Val Lys His 

420 425 430 

Asn Arg He Lys Gly He Pro Lys Leu Asp Asp Ala Asn Asp Ala Gly 

435 440 445 

Gly Arg Asn Ser Thr Glu Cys Thr Leu He Leu Thr Glu Gly Asp Ser 

450 455 460 

Ala Lys Thr Leu Ala Val Ser Gly Leu Gly Val Val Gly Arg Asp Lys 
465 470 475 480 

Tyr Gly Val Phe Pro Leu Arg Gly Lys He Leu Asn Val Arg Glu Ala 

485 490 495 

Ser His Lys Gin He Met Glu Asn Ala Glu He Asn Asn He He Lys 

500 505 510 

He Val Gly Leu Gin Tyr Lys Lys Asn Tyr Glu Asp Glu Asp Ser Leu 

515 520 525 

Lys Thr Leu Arg Tyr Gly Lys He Met He Met Thr Asp Gin Asp Gin 

530 535 540 

Asp Gly Ser His He Lys Gly Leu Leu He Asn Phe He His His Asn 
545 550 555 560 

Trp Pro Ser Leu Leu Arg His Arg Phe Leu Glu Glu Phe He Thr Pro 

565 570 575 

He Val Lys Val Ser Lys Asn Lys Gin Glu Met Ala Phe Tyr Ser Leu 

580 585 590 

Pro Glu Phe Glu Glu Trp Lys Ser Ser Thr Pro Asn His Lys Lys Trp 

595 600 605 

Lys Val Lys Tyr Tyr Lys Gly Leu Gly Thr Ser Thr Ser Lys Glu Ala 

610 615 620 

Lys Glu. Tyr Phe Ala Asp Met Lys Arg His Arg He Gin Phe Lys Tyr 
625 630 635 640 

Ser Gly Pro Glu Asp Asp Ala Ala He Ser Leu Ala Phe Ser Lys Lys 

645 650 655 

Gin He Asp Asp Arg Lys Glu Trp Leu Thr Asn Phe Met Glu Asp Arg 

660 665 670 

Arg Gin Arg Lys Leu Leu Gly Leu Pro Glu Asp Tyr Leu Tyr Gly Gin 

675 680 685 

Thr Thr Thr Tyr Leu Thr Tyr Asn Asp Phe He Asn Lys Glu Leu He 

690 695 700 

Leu Phe Ser Asn Ser Asp Asn Glu Arg Ser He Pro Ser Met Val Asp 
705 710 715 720 

Gly Leu Lys Pro Gly Gin Arg Lys Val Leu Phe Thr Cys Phe Lys Arg 

725 730 735 

Asn Asp Lys Arg Glu Val Lys Val Ala Gin Leu Ala Gly Ser Val Ala 

740 745 750 

Glu Met Ser Ser Tyr His His Gly Glu Met Ser Leu Met Met Thr He 

755 760 765 

He Asn Leu Ala Gin Asn Phe Val Gly Ser Asn Asn Leu Asn Leu Leu 

770 775 780 

Gin Pro He Gly Gin Phe Gly Thr Arg Leu His Gly Gly Lys Asp Ser 
785 790 795 * 800 

Ala ser Pro Arg Tyr He Phe Thr Met Leu Ser Ser Leu Ala Arg Leu 

805 810 815 

Leu Phe Pro Pro Lys Asp Asp His Thr Leu Lys Phe Leu Tyr Asp Asp 

820 825 830 

Asn Gin Arg Val Glu Pro Glu Trp Tyr He Pro He He Pro Met Val 

835 840 845 

Leu He Asn Gly Ala Glu Gly He Gly Thr Gly Trp Ser Cys Lys He 

850 855 860 

Pro Asn Phe Asp Val Arg Glu He Val Asn Asn He Arg Arg Leu Met 
865 870 875 880 

Asp Gly Glu Glu Pro Leu Pro Met Leu Pro Ser Tyr Lys Asn Phe Lys 

885 890 " 895 

Gly Thr He Glu Glu Leu Ala Pro Asn Gin Tyr Val He Ser Gly Glu 

900 905 910 

Val Ala He Leu Asn Ser Thr Thr He Glu He Ser Glu Leu Pro Val 

915 920 925 

Arg Thr Trp Thr Gin Thr Tyr Lys Glu Gin Val Leu Glu Pro Met Leu 
930 935 94 0 
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Asn Gly Thr Glu Lys Thr Pro Pro Leu He Thr Asp Tyr Arg Glu Tyr 
9*5 950 955 ^ 960 

His Thr Asp Thr Thr Val Lys Phe Val Val Lys Met Thr Glu Glu Lys 

965 970 975 

Leu Ala Glu Ala Glu Arg Val Gly Leu His Lys Val Phe Lys Leu Gin 

980 985 * 990 

Thr Ser Leu Thr Cys Asn Ser Met Val Leu Phe Asp His Val Gly Cys 
995 1000 1005 



Leu 


Lys 


Lys 


Tyr 


Asp 


Thr 


Val 


Leu Asp 


He 


Leu 


Arg 


Asp 


Phe 


Phe 




1010 










1015 










1020 






Glu 


Leu 


Arg 


Leu 


Lys 


Tyr 


Tyr 


Gly Leu 


Arg Lys 


Glu 


Trp 


Leu 


Leu 




1025 










1030 










1035 






Gly 


Met 


Leu 


Gly 


Ala 


Glu 


Ser 


Ala 


Lys 


Leu 


Asn 


Asn 


Gin Ala Arg 




1040 










1045 










1050 








Phe 


He 


Leu 


Glu 


Lys 


He 


Asp 


Gly Lys 


He 


He 


He 


Glu 


Asn 


Lys 




1055 










1060 










1065 






Pro 


Lys 


Lys 


Glu 


Leu 


He 


Lys 


V Ct J. 


itcu 


He Gin Arg 


Gly 


Tyr 


Asp 




1070 










1075 










i nan 

1UOU 


Ser 


Asp 


Pro 


Val 


Lys 


Ala 


Trp 


Lys 


Glu 


Ala 


Gin 


/••I <■-» 


Lys 


Val 


Pro 




1085 










1090 










i not: 






Asp 


Glu 
1100 


Glu 


Glu 


Asn 


Glu 


Glu 
1105 


Ser 


Asp 


Asn 


Glu 


T « wr* 

uys 

1JL1U 


Glu 


Thr 


Glu 


Lys 


Ser 


Asp 


Ser 


Val 


Thr 


Asp 


Ser 


Gly 


Pro 


Thr 


Pne 


Asn 


Tyr 


Leu 




1115 










1120 










JL±ZZ> 






Leu 


Asp 


Met 


Pro 


Leu 


Trp 


Tyr 


Leu 


Thr 


Lys 


Glu 


Jjys 


Lys 


Asp 


Glu 




1130 










1135 










1140 




Leu 


Cys 


Arg 


Leu 


Arg 


Asn 


Glu 


Lys 


Glu 


Gin 


Glu 


Leu 


Asp 


Thr 


Leu 




1145 










1150 










1155 






Lys 


Arg 
1160 


Lys 


Ser 


Pro 


Ser 


Asp 
1165 


Leu 


Trp 


Lys 


Glu 


Asp 
1170 


Leu 


Ala 


Thr 


Phe 


He 


Glu 


Glu 


Leu 


Glu 


Ala 


Val 


Glu 


Ala 


Lys 


Glu 


Lys 


Gin 


Asp 




1175 










1180 










1185 




Glu 


Gin 


Val 


Gly 


Leu 


Pro 


Gly 


Lys 


Gly 


Gly Lys 


Ala 


Lys 


Gly Lys 




1190 










1195 










1200 






Lys 


Thr 


Gin 


Met 


Ala 


Glu 


Val 


Leu 


Pro 


Ser 


Pro 


Arg 


Gly Gin Arg 




1205 










1210 










1215 








Val 


He 


Pro 


Arg 


He 


Thr 


He 


Glu 


Met 


Lys 


Ala 


Glu 


Ala 


Glu 


Lys 




1220 










1225 










1230 






Lys 


Asn 


Lys 


Lys 


Lys 


He 


Lys 


Asn 


Glu 


Asn 


Thr 


Glu 


Gly 


Ser 


Pro 




1235 










1240 










1245 






Gin 


Glu 


Asp 


Gly 


Val 


Glu 


Leu 


Glu Gly 


Leu 


Lys 


Gin 


Arg 


Leu 


Glu 




1250 










1255 










1260 






Lys 


Lys 


Gin 


Lys 


Arg 


Glu 


Pro 


Gly Thr 


Lys 


Thr 


Lys 


Lys 


Gin 


Thr 




1265 










1270 










1275 






Thr 


Leu 


Ala 


Phe 


Lys 


Pro 


He 


Lys 


Lys 


Gly Lys 


Lys 


Arg Asn 


Pro 




1280 










1285 










1290 








Trp 


Ser 


Asp 


Ser 


Glu 


Ser 


Asp 


Arg 


Ser 


Ser Asp 


Glu 


Ser 


Asn 


Phe 




1295 










1300 










1305 








Asp 


Val 


Pro 


Pro 


Arg 


Glu 


Thr 


Glu 


Pro 


Arg Arg 


Ala 


Ala 


Thr 


Lys 




1310 










1315 










1320 






Thr 


Lys 


Phe 


Thr 


Met 


Asp 


Leu 


Asp 


Ser 


Asp Glu Asp 


Phe 


Ser Asp 


Phe 


1325 










1330 










1335 








Asp 


Glu 


Lys 


Thr 


Asp 


Asp 


Glu Asp 


Phe 


Val 


Pro 


Ser 


Asp 


Ala 




1340 










1345 










1350 






Ser 


Pro 


Pro 


Lys 


Thr 


Lys 


Thr 


Ser 


Pro 


Lys 


Leu 


Ser 


Asn 


Lys 


Glu 




1355 










1360 








1365 






Leu 


Lys 


Pro 


Gin 


Lys 


Ser 


Val 


Val 


Ser 


Asp 


Leu 


Glu 


Ala 


Asp 


Asp 




1370 










1375 








1380 




Val 


Lys 
1385 


Gly 


Ser 


Val 


Pro 


Leu 
1390 


Ser 


Ser 


Ser 


Pro 


Pro 
1395 


Ala 


Thr 


His 


Phe 


Pro 


Asp 


Glu 


Thr 


Glu 


He 


Thr 


Asn 


Pro 


Val 


Pro 


Lys 


Lys 


Asn 




1400 










1405 










1410 




Val 


Thr 
1415 


Val 


Lys 


Lys 


Thr 


Ala 
1420 


Ala 


Lys 


Ser 


Gin 


Ser 
1425 


Ser 


Thr 


Ser 


Thr 


Thr 


Gly 


Ala 


Lys 


Lys 


Arg 


Ala 


Ala 


Pro 


Lys 


Gly 


Thr 


Lys 


Arg 




1430 










1435 








1440 
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ASP 


Pro 


Ala 


Leu 


Asn 


Ser 


Glv 


Val Ser 


Gin 


Lvs 


Pro 


Asp 


Pro 


Ala 




1445 










1450 








1455 








Lvs 


Thr 


LVS 


Asn 


Arg 


Arg 


Lvs 


Arg Lys 


Pro 


Ser 


Thr 


Ser 


Asp Asp 




1460 










1465 








1470 








Ser 


Asp 


Ser 


Asn 


Phe 


Glu 


Lys 


He Val 


Ser 


Lys 


Ala 


Val 


Thr 


Ser 




1475 










1480 








1485 








Lys 


Lys 


Ser 


Lys 


Gly Glu 


Ser 


Asp Asp 


Phe 


His 


Met 


Asp 


Phe 


Asp 




1490 










1495 








1500 








Ser 


Ala 


Val 


Ala 


Pro 


Arg 


Ala 


Lys Ser 


val 


Arg 


Ala 


Lys 


Lys 


Pro 




1505 










1510 








1515 








lie 


Lys 


Tyr 


Leu 


Glu 


Glu 


Ser 


Asp Glu Asp Asp 


Leu 


Phe 








1520 










1525 








1530 









<210> 47 



<211> 258 
<212> PRT 
<213> Homo sapiens 



<400> 47 



Met 


Leu 


Pro 


Leu 


Cys Leu 


Val Ala 


Ala 


Leu 


Leu Leu Ala Ala Gly 


Pro 


1 








5 






10 






15 




Gly 


Pro 


Ser 


Leu 


Gly Asp 


Glu Ala 


He 


His 


Cys 


Pro 


Pro Cys Ser 


Glu 








20 






25- 








30 




Glu 


Lys 


Leu 


Ala 


Arg Cys 


Arg Pro 


Pro 


Val 


Gly 


Cys 


Glu Glu Leu 


Val 






35 






40 










45 




Arg 


Glu 


Pro 


Gly 


Cys Gly 


Cys Cys 


Ala 


Thr 


Cys 


Ala 


Leu Gly Leu 


Gly 




50 








55 








60 






Met 


Pro 


Cys 


Gly 


Val Tyr 


Thr Pro 


Arg 


Cys 


Gly 


Ser 


Gly Leu Arg 


Cys 


65 








70 








75 






80 


Tyr 


Pro 


Pro Arg 


Gly Val 


Glu Lys 


Pro 


Leu 


His 


Thr 


Leu Met His 


Gly 










85 






90 






95 




Gin 


Gly 


Val 


Cys 


Met Glu 


Leu Ala 


Glu 


He 


Glu 


Ala 


He Gin Glu 


Ser 








100 






105 








110 




Leu 


Gin 


Pro 


Ser 


Asp Lys Asp Glu Gly Asp 


His 


Pro 


Asn Asn Ser 


Phe 






115 






120 










125 




Ser 


Pro 


Cys 


Ser 


Ala His 


Asp Arg Arg 


Cys 


Leu 


Gin 


Lys His Phe 


Ala 




130 








135 








14 0 






Lys 


He 


Arg Asp 


Arg Ser Thr Ser Gly Gly 


Lys 


Met 


Lys Val Asn Gly 


145 








150 








155 






160 


Ala 


Pro 


Arg 


Glu 


Asp Ala Arg Pro Val 


Pro 


Gin Gly 


Ser Cys Gin 


Ser 










165 






170 






175 




Glu 


Leu 


His 


Arg 


Ala Leu 


Glu Arg 


Leu 


Ala 


Ala 


Ser 


Gin Ser Arg 


Thr 








180 






185 








190 




His 


Glu 


Asp 


Leu 


Tyr He 


He Pro 


He 


Pro 


Asn 


Cys 


Asp Arg Asn Gly 






195 






200 










205 




Asn 


Phe 


His 


Pro 


Lys Gin 


Cys His 


Pro 


Ala 


Leu 


Asp 


Gly Gin Arg Gly 




210 








215 








220 






Lys 


Cys 


Trp 


Cys 


Val Asp 


Arg Lys 


Thr Gly 


Val 


Lys 


Leu Pro Gly Gly 


225 








230 








235 






240 


Leu 


Glu 


Pro 


Lys 


Gly Glu Leu Asp 


Cys 


His 


Gin Leu Ala Asp Ser 


Phe 










245 






250 






255 





Arg Glu 



<210> 48 

<211> 378 

<212> PRT 

<213> Homo sapiens 
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<400> 48 

Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu 

15 10 15 

Val He Phe Gin Val Cys Leu Cys Gin Asp Glu Val Thr Asp Asp Tyr 

20 25 30 

He Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys 

35 40 45 

Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro He Met 

50 " 55 60 

Tyr Ser He He Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val 
65 70 75 80 

Leu Thr Tyr He Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr 

85 90 95 

Leu Leu Asn Leu Ala Val Ala Asp He Leu Phe Leu Leu Thr Leu Pro 

100 105 110 

Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe 

115 120 125 

Cys Lys Leu He Phe Ala He Tyr Lys Met Ser Phe Phe Ser Gly Met 

130 135 140 

Leu Leu Leu Leu Cys He Ser He Asp Arg Tyr Val Ala He Val Gin 
145 150 155 160 

Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu He Ser Lys 

165 170 175 

Leu Ser Cys Val Gly He Trp He Leu Ala Thr Val Leu Ser He Pro 

180 185 190 

Glu Leu Leu Tyr Ser Asp Leu Gin Arg Ser Ser Ser Glu Gin Ala Met 

195 200 205 

Arg Cys Ser Leu He Thr Glu His Val Glu Ala Phe He Thr He Gin 

210 215 220 

Val Ala Gin Met Val He Gly Phe Leu Val Pro Leu Leu Ala Met Ser 
225 230 235 240 

Phe Cys Tyr Leu Val He He Arg Thr Leu Leu Gin Ala Arg Asn Phe 

245 250 255 

Glu Arg Asn Lys Ala lie Lys Val He He Ala Val Val Val Val Phe 

260 265 270 

He Val Phe Gin Leu Pro Tyr Asn Gly Val Val Leu Ala Gin Thr Val 

275 280 285 

Ala Asn Phe Asn He Thr Ser Ser Thr Cys Glu Leu Ser Lys Gin Leu 

290 295 300 

Asn He Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys 
305 310 315 320 

Val Asn Pro Phe Leu Tyr Ala Phe He Gly Val Lys Phe Arg Asn Asp 

325 " 330 335 

Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gin Glu Gin Leu 

340 345 350 

Arg Gin Trp Ser Ser Cys Arg His He Arg Arg Ser Ser Met Ser Val 

355 360 365 

Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro 

370 375 
<210> 49 

<211> 411 

<212> PRT 

<213> Homo sapiens 



<400> 49 

Met Ser Lys Arg Pro Ser Tyr Ala 
1 5 
Thr Gin Met Pro Ser Thr Pro Gly 
20 



Pro Pro Pro Thr Pro Ala Pro Ala 

10 15 
Phe Val Gly Tyr Asn Pro Tyr Ser 
25 " 30 
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His 


Leu 


Ala Tyr Asn 


Asn 


Tyr 


Arg 






35 






40 


Ser 


Arg 


Val Thr Ala 


Ser 


Ser 


Gly 




50 






55 


Pro 


Pro 


Asp Lys Pro 


Leu 


Met 


Pro 


65 






70 






Trp 


Asp 


Gin Val Lys 


Ala 


Ser 


Asn 






85 








Gly 


Lys 


He He Gly 


Gly 


Met 


Trp 






100 








Gin 


Glu 


Tyr Leu Asn 


Glu 


Tyr 


Glu 






115 






120 


Ser 


Met 


Lys Ala Tyr 


His 


Asn 


Ser 




130 






135 




Ala 


Lys 


Ser Arg Ala 


Glu 


Ala 


Ala 


145 






150 






Gin 


Ser 


Arg Met Glu 


Lys 


Gly 


Glu 






165 








Glu 


Asp 


Pro Asp Asp 


Tyr 


Asp 


Asp 






180 








Thr 


Ala 


Arg Phe Gin 


Arg 


Asn 


His 






195 






200 


Glu 


Ser 


Val Val Pro 


Asp 


Val 


Arg 




210 






215 




Gin 


Val 


Leu Lys Arg 


Gin 


Val 


Gin 


225 






230 






Leu 


Glu 


Ala Glu Leu 


Leu 


Gin 


He 






245 








Arg 


Lys 


Phe Leu Glu 


Ser 


Thr 


Asp 






260 






Leu 


Cys 


Gly Leu Lys 


Val 


Glu 


Val 






275 






280 


He 


Ala 


Gin Ala Glu 


Glu 


Gin 


Ala 




290 






295 




Lys 


Glu 


Ala Ala Glu 


Gin 


Ala 


Glu 


305 






310 






Glu 


Glu 


Glu Gin Ala 


Ala 


Asn 


Lys 






325 






Asn 


He 


Pro Met Glu 


Thr 


Glu 


Glu 






340 








Ser 


Gin 


Gin Asn Gly 


Glu 


Glu 


Gly 






355 






360 


Ser 


Gly 


Gin Glu Gly 


Val 


Asp 


Ser 




370 






375 




Ser 


Asn 


Thr Gly Ser 


Glu 


Ser 


Asn 


385 






390 






Thr 


Asp 


Pro He Pro 


Glu 


Asp 


Glu 






405 
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Leu 


Gly 


Gly 


Asn 


Pro 


Ser 


Thr 


Asn 










45 








He 


Thr 


He 


Pro 


Lys 


Pro 


Pro 


Lys 








60 










Tyr 


Met 


Arg 


Tyr 


Ser 


Arg 


Lys 


Val 






75 










80 


Pro 


Asp 


Leu 


Lys 


Leu 


Trp 


Glu 


He 




90 










95 




Arg 


Asp 


Leu 


Thr 


Asp 


Glu 


Glu 


Lys 


105 










110 






Ala 


Glu 


Lys 


He 


Glu 


Tyr 


Asn 


Glu 










125 








Pro 


Ala 


Tyr 


Leu 


Ala 


Tyr 


He 


Asn 








140 










Leu 


Glu 


Glu 


Glu 


Ser 


Arq 


Gin 


Arq 






155 










160 


Pro 


Tyr 


Met 


Ser 


He 


Gin 


Pro 


Ala 




170 










175 




Gly 


Phe 


Ser 


Met 


Lys 


His 


Thr 


Ala 


185 










190 






Arg 


Leu 


He 


Ser 


Glu 


He 


Leu 


Ser 










205 








Ser 


Val 


Val 


Thr 


Thr 


Ala 


Arq 


Met 








220 








Ser 


Leu 


Met 


Val 


His 


Gin 


Arq 


Lvs 






235 










240 


Glu 


Glu 


Arq 


His 


Gin 


Glu. 


LVS 


Lvs 




250 










255 




Ser 


Phe 


Asn 


Asn 


Glu 


Leu 


Lvs 


Arq 


265 










270 






Asp 


Met 


Glu 


Lvs 


He 


Ala 


Ala 


Glu 










285 








Arq 


LVS 


Arq 


Gin 


Glu 


Glu 


Ara 


Glu 








300 










Arg 


Ser 


Gin 


Ser 


Ser 


He 


Val 


Pro 






315 










320 


Gly 


Glu 


Glu 


Lvs 


Lvs 


ASD 


Asr> 


Glu 




330 










335 




Thr 


His 


Leu 


Glu 


Glu 


Thr 


Thr 


Glu 


345 










350 






Thr 


Ser 


Thr 


Pro 


Glu 


Asp 


Lys 


Glu 










365 








Met 


Ala 


Glu 


Glu 


Gly 


Thr 


Ser 


Asp 








380 










Ser 


Ala 


Thr 


Val 


Glu 


Glu 


Pro 


Pro 






395 










400 


Lys 


Lys 


Glu 













410 



<212> PRT 

<213> Homo sapiens 



<400> 50 



Met Ser Val Arg Tyr 


Ser 


Ser 


Ser 


Lys 


His 


Tyr 


Ser 


Ser 


Ser 


Arg 


Ser 


1 5 










10 










15 




Gly Gly Gly Gly Gly 


Gly 


Gly 


Gly 


Cys 


Gly 


Gly 


Gly 


Gly 


Gly 


Val 


Ser 


20 








25 








30 






Ser Leu Arg He Ser 


Ser 


Ser 


Lys 


Gly 


Ser 


Leu 


Gly 


Gly 


Gly 


Phe 


Ser 



35 40 45 
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Ser Gly Gly Phe Ser Gly Gly Ser Phe Ser Arg Gly Ser Ser Gly Glv 

SO 55 60 

Gly Cys Phe Gly Gly Ser Ser Gly Gly Tyr Gly Gly Leu Gly Gly Phe 

^® 75 on 

Gly Gly Gly Ser Phe His Gly Ser Tyr Gly Ser Ser Ser Phe Gly Glv 

85 90 
Ser Tyr Gly Gly Ser Phe Gly Gly Gly Asn Phe Gly Gly Gly Ser Phe 

100 105 no 

Gly Gly Gly Ser Phe Gly Gly Gly Gly Phe Gly Gly Gly Gly Phe Gly 

115 120 125 

Gly Gly Phe Gly Gly Gly Phe Gly Gly Asp Gly Gly Leu Leu Ser Gly 
130 135 140 Y 

Asn Glu Lys Val Thr Met Gin Asn Leu Asn Asp Arg Leu Ala Ser Tvr 
145 150 155 iso 

Leu Asp Lys Val Arg Ala Leu Glu Glu Ser Asn Tyr Glu Leu Glu Gly 

165 170 i 75 

Lys lie Lys Glu Trp Tyr Glu Lys His Gly Asn Ser His Gin Gly Glu 

l ao 185 iso 

Pro Arg Asp Tyr Ser Lys Tyr Tyr Lys Thr lie Asp Asp Leu Lys Asn 

195 200 205 

Gin He Leu Asn Leu Thr Thr Asp Asn Ala Asn He Leu Leu Gin He 

210 215 220 

Asp Asn Ala Arg Leu Ala Ala Asp Asp Phe Arg Leu Lys Tyr Glu Asn 

230 235 240 

Glu Val Ala Leu Arg Gin Ser Val Glu Ala Asp He Asn Gly Leu Arq 

245 250 255 

Arg Val Leu Asp Glu Leu Thr Leu Thr Lys Ala Asp Leu Glu Met Gin 

260 265 270 

He Glu Ser Leu Thr Glu Glu Leu Ala Tyr Leu Lys Lys Asn His Glu 

275 280 285 

Glu Glu Met Lys Asp Leu Arg Asn Val Ser Thr Gly Asp Val Asn Val 

290 295 300 

Glu Met Asn Ala Ala Pro Gly Val Asp Leu Thr Gin Leu Leu Asn Asn 

310 315 320 

Met Arg Ser Gin Tyr Glu Gin Leu Ala Glu Gin Asn Arg Lys Asp Ala 

325 330 335 

Glu Ala Trp Phe Asn Glu Lys Ser Lys Glu Leu Thr Thr Glu He Asn 
340 345 350 * 

Asn Asn He Glu Gin He Ser Ser Tyr Lys Ser Glu He Thr Glu Leu 

355 360 365 

Arg Arg Asn Val Gin Ala Leu Glu He Glu Leu Gin Ser Gin Leu Ala 

370 375 380 

Leu Lys Gin Ser Leu Glu Ala Ser Leu Ala Glu Thr Glu Gly Arq Tvr 

5 390 395 3 * 

Cys Val Gin Leu Ser Gin He His Ala Gin He Ser Ala Leu Glu Glu 

405 41° 415 

Gin Leu Gin Gin He Arg Ala Glu Thr Glu Cys Gin Asn Thr Glu Tyr 

420 425 430 

Gin Gin Leu Leu Asp He Lys He Arg Leu Glu Asn Glu He Gin Thr 
435 440 445 

^f? Ser L6U LeU Glu Gly Glu Qlv Ser Ser Gly Gly Gly Gly Arq 
450 455 460 

Gly Gly Gly Ser Phe Gly Gly Gly Tyr Gly Gly Gly Ser Ser Gly Gly 

Gly Ser Ser Gly Gly Gly Tyr Gly Gly Gly His Gly Gly Ser Ser Gly 
485 490 495 * 

Gly Gly Tyr Gly Gly Gly Ser Ser Gly Gly Gly Ser Ser Gly Gly Gly 

500 505 510 

Tyr Gly Gly Gly Ser Ser Ser Gly Gly His Gly Qly Gly Ser Ser Ser 

Gly Gly His Gly Gly Ser Ser Ser Gly Gly Tyr Gly Gly Gly Ser Ser 

535 540 
Gly Gly Gly Gly Gly Gly Tyr Gly Gly Gly Ser Ser Gly Gly Gly Ser 

550 555 560 

Ser Ser Gly Gly Gly Tyr Gly Gly Gly Ser Ser Ser Gly Gly His Lys 
565 570 575 
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Ser Ser Ser Ser Gly Ser Val Gly 
580 

Tyr 

<210> 51 

<211> 494 

<212> PRT 

<213> Homo sapiens 
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Glu Ser Ser Ser Lys Gly Pro Arg 
585 590 



<400> 51 

Met Asp Leu Ser Asn Asn Thr Met Ser Leu Ser Val Arg Thr Pro Gly 

15 10 15 

Leu Ser Arg Arg Leu Ser Ser Gin Ser Val lie Gly Arg Pro Arg Gly 

20 25 30 

Met Ser Ala Ser Ser Val Gly Ser Gly Tyr Gly Gly Ser Ala Phe Gly 

35 40 45 

Phe Gly Ala Ser Cys Gly Gly Gly Phe Ser Ala Ala Ser Met Phe Gly 

50 55 60 

Ser Ser Ser Gly Phe Gly Gly Gly Ser Gly Ser Ser Met Ala Gly Gly 
65 70 75 80 

Leu Gly Ala Gly Tyr Gly Arg Ala Leu Gly Gly Gly Ser Phe Gly Gly 

85 90 95 

Leu Gly Met Gly Phe Gly Gly Ser Pro Gly Gly Gly Ser Leu Gly lie 

100 105 HO 

Leu Ser Gly Asn Asp Gly Gly Leu Leu Ser Gly Ser Glu Lys Glu Thr 

115 120 125 

Met Gin Asn Leu Asn Asp Arg Leu Ala Ser Tyr Leu Asp Lys Val Arg 

130 135 140 

Ala Leu Glu Glu Ala Asn Thr Glu Leu Glu Asn Lys He Arg Glu Trp 
145 150 155 160 

Tyr Glu Thr Arg Gly Thr Gly Thr Ala Asp Ala Ser Gin Ser Asp Tyr 

165 170 175 

Ser Lys Tyr Tyr Pro Leu He Glu Asp Leu Arg Asn Lys He He Ser 

180 185 190 

Ala Ser He Gly Asn Ala Gin Leu Leu Leu Gin He Asp Asn Ala Arg 

195 200 205 

Leu Ala Ala Glu Asp Phe Arg Met Lys Tyr Glu Asn Glu Leu Ala Leu 

210 215 220 

Arg Gin Gly Val Glu Ala Asp He Asn Gly Leu Arg Arg Val Leu Asp 
225 230 235 240 

Glu Leu Thr Leu Thr Arg Thr Asp Leu Glu Met Gin He Glu Ser Leu 

245 ~* 250 255 

Asn Rl" Glu Leu Ala Tvr Met Lys Lys Asn His Glu Asp Glu Leu Gin 

260 265 270 

Ser Phe Arg Val Gly Gly Pro Gly Glu Val Ser Val Glu Met Asp Ala 

275 280 285 

Ala Pro Gly Val Asp Leu Thr Arg Leu Leu Asn Asp Met Arg Ala Gin 

290 295 300 

Tyr Glu Thr He Ala Glu Gin Asn Arg Lys Asp Ala Glu Ala Trp Phe 
305 310 315 320 

He Glu Lys Ser Gly Glu Leu Arg Lys Glu He Ser Thr Asn Thr Glu 

325 330 335 

Gin Leu Gin Ser Ser Lys Ser Glu Val Thr Asp Leu Arg Arg Ala Phe 

340 345 350 

Gin Asn Leu Glu He Glu Leu Gin Ser Gin Leu Ala Met Lys Lys Ser 

355 360 365 

Leu Glu Asp Ser Leu Ala Glu Ala Glu Gly Asp Tyr Cys Ala Gin Leu 

370 375 380 

Ser Gin Val Gin Gin Leu He Ser Asn Leu Glu Ala Gin Leu Leu Gin 
385 390 395 400 
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Val Arg Ala Asp 


Ala 


Glu 


Arg Gin 




405 




Asn Val Lys Ala 


Arg 


Leu 


Glu Leu 


420 








Leu Asp Gly Glu 


Ala 


Gin 


Gly Asp 








A A r\ 

440 


Thr Asp Ser Lys 


Ser 


Gin 


Ala Gin 








a a c 


Thr Lys Thr Arg 


Lys 


He 


Lys Thr 


465 




*± / \j 




Glu Val Val Ser 


Ser 


Gin 


Val Gin 




485 






<210> 52 








<211> 361 








<212> PRT 








<213> Homo sapiens 







-65- 



Asn Val Asp His Gin Arg Leu Leu 

410 ~ 415 
Glu He Glu Thr Tyr Arg Arg Leu 
425 430 
Gly Leu Glu Glu Ser Leu Phe Val 
445 

Ser Thr Asp Ser Ser Lys Asp Pro 
460 

Val Val Gin Glu Met Val Asn Gly 
475 480 
Glu He Glu Glu Leu Met 
490 



<400> 52 

Cys Asn Trp Phe Cys Glu Gly Ser Phe Asn Gly Ser Glu Lys Glu Thr 

1 5 10 15 

Met Gin Phe Leu Asn Asp Arg Leu Ala Ser Tyr Leu Glu Lys Val Arg 

20 25 30 

His Val Glu Arg Asp Asn Ala Glu Leu Glu Asn Leu He Arg Glu Arq 

35 40 45 

Ser Gin Gin Gin Glu Pro Leu Leu Cys Pro Ser Tyr Gin Ser Tyr Phe 

50 55 60 

Lys Thr lie Glu Glu Leu Gin Gin Lys He Leu Cys Ser Lys Ser Glu 
65 70 75 & ^ 80 

Asn Ala Arg Leu Val Val Gin He Asp Asn Ala Lys Leu Ala Ala Asp 

85 90 95 

Asp Phe Arg Thr Lys Tyr Gin Thr Glu Gin Ser Leu Arg Gin Leu Val 

100 105 no 

Glu Ser Asp He Asn Ser Leu Arg Arg He Leu Asp Glu Leu Thr Leu 

115 120 125 

Cys Arg Ser Asp Leu Glu Ala Gin Met Glu Ser Leu Lys Glu Glu Leu 

130 135 140 

Leu Ser Leu Lys Gin Asn His Glu Gin Glu Val Asn Thr Leu Arg Cys 
14 5 150 155 160 

Gin Leu Gly Asp Arg Leu Asn Val Glu Val Asp Ala Ala Pro Ala Val 

165 170 175 

Asp Leu Asn Gin Val Leu Asn Glu Thr Arg Asn Gin Tyr Glu Ala Leu 

180 185 190 

Val Glu Thr Asn Arg Arg Glu Val Glu Gin Trp Phe Ala Thr Gin Thr 

195 200 205 

Glu Glu Leu Asn Lys Gin Val Val Ser Ser Ser Glu Gin Leu Gin Ser 

210 215 220 

Tyr Gin Ala Glu He He Glu Leu Arg Arg Thr Val Asn Ala Leu Glu 
225 230 235 240 

He Glu Leu Gin Ala Gin His Asn Leu Arg Tyr Ser Leu Glu Asn Thr 

245 250 255 

Leu Thr Glu Ser Glu Ala Arg Tyr Ser Ser Gin Leu Ser Gin Val Gin 

260 265 270 

Ser Leu He Thr Asn Val Glu Ser Gin Leu Ala Glu He Arg Ser Asp 

275 280 285 

Leu Glu Arg Gin Asn Gin Glu Tyr Gin Val Leu Leu Asp Val Arg Ala 

290 295 300 

Arg Leu Glu Cys Glu He Asn Thr Tyr Arg Ser Leu Leu Glu Ser Glu 
305 310 315 320 

Asp Cys Lys Leu Pro Ser Asn Pro Cys Ala Thr Thr Asn Ala Cys Glu 
325 330 335 
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Lys Pro lie Gly Ser Cys Val Thr Asn Pro Cys Gly Pro Arg Ser Arg 

340 345 350 

Cys Gly Pro Cys Asn Thr Phe Gly Tyr 

355 360 
<210> 53 

<211> 3282 

<212> DNA 

<213> Homo sapiens 



<400> 53 

atgaaggaga 

ccgctggtct 

atcgttcagg 

gccagggtga 

ggaggctggg 

ctcaccatgg 

tacatctgcg 

aaccgccatg 

tgtgaggaag 

ttcagcaaga 

ggaggtggca 

gcctcaccat 

agtcgaaagg 

agcatcctca 

tcctcccacc 

aaaaagtctt 

gtgagcagtt 

cccttccagc 

ctggagcagc 

ccttctccct 

tctggctttg 

gcgccctccc 

acccacaaac 

acaggcctga 

cccaagggca 

gcccagctgg 

agctccggcc 

ttgagcctgg 

tcccacagtg 

gggcccccag 

ccctcctctt 

agcctccctt 

caagctgaga 

acctcccctc 

gaccagacag 

actaacatgg 

gagatgctga 

ctgagcctga 

ttccctgccc 

ctgcctcccc 

agcagcctcg 

ccgagccgca 

cagcagagcc 

cagccgcaga 

gctgaggggc 

ggggggttga 

cccacgctgc 

cagcagcaag 

ccggaacacc 

cagcgtctgc 

tccaccccgc 



tggtaggagg 

actgcgatgg 

tgccaacggg 

ggtgtgagct 

cacacgtggt 

agcccatcgt 

aggagacggg 

gatgtcgaca 

aagtgctgga 

tgaagacatc 

gcatgggggg 

ccacgcagca 

acaaagaacg 

ccccgcccgt 

acgaggccag 

ccagccatag 

ttacctccgc 

ctgcagtctc 

cagaggagga 

cagctcccga 

ggcccatcat 

ctggggacta 

ggatgcccgc 

aggagaagaa 

gccggaacaa 

ctggctttac 

tgggaggtct 

agtccccctt 

gcgggatgct 

ggacctcggc 

ctgcttctat 

ctacccacat 

gcagccacac 

aggagagtct 

cctctgcacc 

agcagcttct 

aggcgctgca 

cggccaaaaa 

tgcctgctgc 

aagccgggag 

gcctggacaa 

gcagctcgtc 

ctgccactct 

acgggttggg 

tgttgggggg 

atggggccgc 

agctgccagg 

agcagcagct 

agactgttgt 

agatggctgg 

tgctgtctgc 



ctgctgcgta 

gcacgcgtgc 

accctggttc 

gtgcccacac 

gtgtgccctc 

gctgcagtac 

ccgggagagc 

agctttccac 

ggtggacaac 

ccggcacagc 

aggtggcagt 

ggagaagcac 

ccttaagcag 

ggtccccact 

cacgcaggag 

cctgagtcat 

ctcctcttct 

gtccctgcag 

caagtactcc 

gccccccaag 

gcgcttctcc 

taagtctccc 

actgagtgcc 

gcacaaagcc 

ggagggcact 

cgccactgct 

gtcctcccga 

actaggggca 

gcgggctgtc 

cctgccccgc 

ctccaccact 

ctttggaacc 

agagccagac 

gtcttccatg 

ctgtgggggc 

ggagaagcag 

cgcgctgcag 

ggagcggctg 

cctgcctgcc 

cagcgactcc 

ctcgctgtcc 

gctgtccttc 

gcccctggcc 

ccgggcaccc 

gctggcaggc 

tgcccccaac 

ctgtctcaac 

ccagcaactc 

ctaccagatg 

gggctcccag 

gggtacccct 



tgttcggacg 

agcgtggccg 

tgccggaaat 

aaagacgggg 

tacatccccg 

gtgcctcatg 

aaggcggcct 

gtcacctgtg 

gtcaagtact 

agcgggggag 

ggtttcatct 

cccacccacc 

aagcacaaga 

gctgacaagg 

acctctgaga 

aaagggaaga 

tcctcctcct 

agctcccctg 

aagcccacag 

gctgaccttt 

accaccacct 

cacgtcacgg 

acccctgtgc 

agcaagagga 

gggggcccag 

gcctcaccct 

acctttgggc 

ggcatctaca 

tgcagcaccc 

ctcagccgct 

caggtgtttt 

cccatgggtg 

ctggaggact 

tcccccatca 

ggccagttag 

ggcgacgggg 

aaggagaacc 

cagattctca 

gccaacggcc 

ttgagcacca 

acttcttctg 

cacagcacgc 

ctgcctgggg 

ggggcagcgg 

agtgggggcc 

cccgcaagct 

agccttacag 

cagcagctcc 

atccagcaga 

ctgcccatgg 

ggcctgctgc 



agaggggctg 

tccaccaagc 

gtgaatctca 

cattgaagag 

aggtgcaatt 

atcgcttcaa 

cgggagcctg 

cccaaatggc 

gcggctactg 

gcggaggagg 

ctgggaggag 

acgagagggg 

agcggcctga 

tctcctcctc 

gcagcaggga 

aactgagcag 

cttcctcctc 

acttctctgc 

cccccgcccc 

ttgagcagaa 

ccagctcagg 

ggtctggggc 

ctgctgatga 

gccgccatgg 

ctgccccatc 

tctctggagg 

cttctgggag 

ccagtaataa 

ctctctcctc 

ccccgttcac 

ctctggctgg 

ccgttaatcc 

gcagcttccg 

gcagcctccc 

acccggcggc 

aggccggcgt 

agcggctgca 

acgtgcagct 

ctgtccctgg 

gcaagagccc 

aggacccaca 

ccccaccgct 

cccctgcccc 

ggctgggggc 

tgcccctcaa 

tgagccaggc 

agcagcagag 

tggcctcccc 

tccagcagaa 

ccagcctgct 

ccacagcgtc 



ggccgagaac 

ttgctatggc 

ggagcgagca 

gactgataat 

tgccaacgtg 

caagacctgt 

catgacctgt 

aggcttgctg 

caaataccac 

cgctggagga 

aagccggtca 

ccagaagaag 

gtcgcccccc 

ggcttcctct 

gtcaaagggg 

tgggaaaggt 

ctctgggggg 

attccccaag 

ttcagcccct 

ggtggtcttc 

ccgggcccgg 

ctcggcaggc 

gacccctgag 

gccaggccgt 

cttgcccagt 

ttccctggtc 

cttgcccagc 

ggaccccatc 

cagcctcctg 

cagcaccctc 

ctctaccttt 

cctcctctcc 

gtgtcggggg 

cgcactcttc 

cccagggacg 

caacatcgtg 

agagcagatc 

ctctgtgccc 

gccctatggc 

tccgggaaag 

ctcaggctgc 

gcccctcctc 

actcccgccc 

catgcccatg 

tgggctcctt 

tggcggggcc 

acatctcctt 

gcagctgacc 

acgggagctg 

ggcaggaagc 

tgctccaccc 



- 60 
120 
180 
240 
300 
360 
420 
48C( 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
180 
186 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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ctgctgcccg ctggagccct agtggctccc tcgcttggca acaacacaag tctcatggcc 3120 

gcagcagctg cagctgcagc agtagcagca gcaggcggac ctccagtcct cactgcccag 3180 

accaacccct tcctcagcct gtcgggagca gagggcagtg gcggtggccc caaaggaggg 324 0 

accgctgaca aaggagcctc agccaaccag gaaaaaggct aa ^ 3282 

<210> 54 

<211> 2227 

<212> DNA 

<213> Homo sapiens 



<400> 54 

gagagcccga 

cagcctctgg 

atgttataat 

cccactcccg 

ccccacctca 

tgcctgcatt 

cccatgtgtg 

acacttcaag 

cggcgggatt 

cgcggcgagg 

tccatcgaat 

ggggatgggg 

gtcatgcatc 

gaggttctgt 

atctacccct 

aagcggctca 

tccgagtgtg 

tcctccctgc 

gccacccacc 

aacgggggta 

actgtcaacg 

cagccaagcc 

tccccagctc 

atgtacatag 

aaatgccaca 

ttttgggggg 

ggccactgga 

tgttgggggc 

gtcagttaga 

ctctgaccag 

ttttacaaat 

gcatcacctc 

tttttttttt 

aggccctgag 

cacccccacc 

tcacctgggg 

actttgccag 

acactcg 

<210> 55 



acaggaagag 
tccccagagc 
ttaccctggg 
accccggaat 
tgtgtgccct 
ccttctgcaa 
acgtgcaggt 
acattgtcta 
tctatgcagc 
tcttggagca 
tctacgaagg 
acaaagagaa 
ttgccaagtt 
acgaggacga 
ggcggcggaa 
ccctagccac 
agtcagtcag 
ccagcccagc 
ctacctcccc 
gcttgaactg 
gcgctcccgt 
tctccactcc 
ttcccacctt 
gaaaaaccaa 
ggaaacaaac 
ggtcttgtgt 
gtctcaataa 
cccttcagac 
attgtttctt 
tggccgcccc 
gcagtttatc 
ctcctcttgc 
ttcctgtgtg 
aggtctcagc 
aaccttaccc 
cctagagtgg 
aggcaaacag 



ggtacagctt 
ggactttgga 
tggaccaggg 
catgcatcgg 
ctgcgggggg 
aacctgcatc 
ccataaaacc 
caaattggtc 
gtaccccctg 
ggagaagggg 
tgccagggac 
aacaggggtg 
tctccgcaac 
gccactgaag 
cgggcctctc 
ggtgcccacc 
cgacaaggct 
caccccatcc 
cactccccct 
cctgcagaca 
gcccccctta 
ttccactttt 
gggggtgggg 
atatacatac 
tttatgtgtg 
aatttgctgt 
agctctgcac 
ccaaagcttt 
tctctccccg 
ccgcgttgtt 
cctggttctg 
agtggaaact 
tgtggccttt 
tcttggagga 
tcaccgggga 
aagtgggggt 
ggaattccgc 



tgtgcaggtc 
agctgaactg 
tcgtacaaaa 
actacacgga 
tacttcatcg 
gtgcgctacc 
cggccgctgc 
cctgggcttt 
acggaggtcc 
gctctgagtg 
cgggatgaga 
cgcttcctgc 
aagatggatg 
gaatactaca 
cccctcaagt 
ccctccgagg 
cccagccctg 
catggctctc 
tcgacagcca 
ccatcctcca 
acttgaggcc 
tctgggccct 
ggcgggtttt 
ttattttcta 
tgtgtatgtg 
ttttgggggt 
catcctcgct 
aggcatgatt 
tttctctccc 
gaatgtccag 
aggagcaagt 
ttgtgcaaag 
gcatcattta 
gggctaaggc 
accctcacta 
gggttaacct 
cgatactgac 



acatgcccac 
cttttgttgc 
gggcaacgct 
tcaaaatcac 
acgccaccac 
tggagaccaa 
tgagcatcag 
ttaaagatga 
ccaacggctc 
atgatgagat 
agaagggccc 
gatgcccagc 
tgcccagcaa 
ccctcatgga 
accgtgtcca 
gcaccaacac 
ccaccctgcc 
ccagttccca 
gtggggccac 
ccagcagggg 
agggaccctc 
tttttccact 
ataaataaat 
tggaccaacc 
tggaaaatgg 
gcctggagat 
gtttcccaag 
ccaactggct 
catcttggct 
aaattgctaa 
gcagggtgga 
aatagatagt 
tcttgtggaa 
tttagcattg 
gcaggactgg 
cacacaagca 
gggctccagg 



tgcagccctc 
tggaagactt 
ccccagtccc 
agagctgaac 
tatcgtggag 
caaatactgc 
gtctgacaaa 
gatgaaacgg 
caatgaggac 
tgtcagcctc 
cctggagaat 
agccatgacc 
gtacaaggtg 
catcgcctac 
gccagcctgc 
cagcggggcg 
agccacctcc 

tgggcctcca 

cacagctgcc 
gcgcaagatg 
tcccttcttc 
tcttctactt 
atatatatat 
agattaattt 
tgttcatttt 
gaactggatg 
gcaggtggtg 
gcatatagga 
gctgtcctgc 
gaacagtgcc 
ggtggcacct 
tctgcctctt 
aagaagattc 
tgaagcgctg 
tggtggagtc 
cagatcccag 
agtcgtcgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
. 840 
V 900 
960 
1020 
.1080 
1140 
.1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
192 0 
1980 
2040 
2100 
2160 
2220 
2227 



<211> 4283 

<212> DNA 

<213> Homo sapiens 



<400> 55 

ttgcgggaaa gagccaaacc ctggcgttgg ggggcccggg cggggagccc ctcccgcggt 60 
ccacagcgac gcctgcccag ccctcctccc cttccggctc cggcacgggg ccccgaggcg 120 
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ttcggaggcc 
gcctccccgg 
cggctgcacc 
ggagggcggg 
cgcgaccatg 
gctggcttgt 
catgtcgtcc 
gaccaagacc 
gccgatcctc 
tcctgttcct 
caatcatctc 
catggtgttc 
agtgacgcgc 
caccacctac 
gcacaacatc 
gccacagttc 
taccaggaac 
tacggttgcc 
caatgacttc 
cctggagaaa 
cctgctggtg 
gcgggcagag 
tggcacacct 
ggaattcgac 
ggaggtgtat 
tgcacatgct 
tgagcagtac 
taccttcagc 
ggtgtatttg 
tgagactgca 
gtgtgcacct 
atttgacacc 
cttggggcta 
ccaggtttgc 
gctctcagtt 
aaccaagtgc 
cttgtctgga 
ttggattcaa 
acaccagaga 
gatggtgagg 
gatgatcacc 
gcctgcctcc 
gtttaaatgg 
gtgccctcga 
cgaactctta 
cgttaagctc 
cagcaaggtg 
tgaggatgtg 
gccttggggc 
aggctttttt 
ccaggccctc 
ctgcattggg 
tgggatgaga 
ccattttgaa 
tccgaagcag 
atggagacgg 
agaggcagat 
aacaactgcc 
aaaacaaatt 
acgtcataat 
cttggaaagc 
agcactgtct 
ttggacttca 
tcaaaccact 
accactacat 
tgggaaaggg 



aggcgggttt 
gacgggccgt 
gcccgggggc 
gcgtggggcg 
gcgcggtgag 
gcgctgtctc 
aactgcacca 
aagaagaagc 
agcgtcctga 
gtcatgctaa 
ttcaataagg 
cgaaaccttc 
agcgccccca 
gaccggcgct 
ttaaagaaat 
ctgggcatgt 
gtgttcagcc 
agagaagcga 
ctcaatgaag 
ctgaagcggg 
ggcatccacg 
gacgaggagt 
ccggacagcc 
ccctctgttg 
ttcatggcca 
gccaaaacgg 
tccaaacgct 
cagagccaga 
ggctagatgg 
gcctgtgaca 
gctcacagga 
caggttaaaa 
gagatgactt 
tatttataat 
cccttcaatt 
tatggatgcc 
ttaactttgt 
gtgaaaactg 
tctcagatca 
ggcaggaaag 
caagccccag 
cactaggtca 
ttatttccct 
tctatttcct 
gctctgggca 
cgggatgacc 
cccccatctt 
atccaggaaa 
tgtttttcct 
ggaaggagtg 
agaactgagc 
ggaggtacat 
tatggtcaga 
tgaatagatt 
gaaccagcac 
cattcttgga 
gggggtcaaa 
gcaagaccac 
tgacttggga 
gaagaatgga 
tgtcttccag 
ctgtgcctga 
ctggggctgc 
gccttggccc 
gacttaggga 
attatgtagg 



ctgtcaggcc 
accacgcgga 
ggggtgcgga 
gacggaacca 
ggagcggggg 
cgccgccccc 
gcaccacggc 
atttcgtgtg 
tgtggggggt 
tgccagatga 
agaacctgcc 
gggagaggtt 
tcaacagtga 
ttgtcatcaa 
accaccagtt 
accgcctgac 
atcggctcac 
gcgacaagga 
ggcagaagct 
acgttgagtt 
acgtggaccg 
gtgagaatga 
ctggcaacct 
acgtctatgc 
tcattgatat 
tgaaacacgg 
tcaacgagtt 
gagctggata 
gagggtggga 
ccgaaagaga 
tgtaacccca 
aggggttccc 
cgtgggttta 
catatttcat 
aaagagatac 
agattggaga 
aatttatgga 
ttgcattatt 
gaatcagaga 
cggctgggct 
gctgtcttag 
agaggaacta 
ttgggaaaac 
tcttccttct 
gatctccaag 
ttgtaggaga 
agagtgtggt 
tccagtttgg 
tgttgccccg 
gcttcctgca 
cacaggctgc 
tcctgcatct 
cttgtagata 
ggtttcctgt 
tgtctctgtg 
cttcactggg 
ccactgcctt 
tacatgactt 
aagggattat 
agtttggatc 
cctccaaacc 
ctcacagcat 
tggattggat 
caggaagggg 
acttgaaacc 
aataatgttt 



cggggaggag 

cggggaggac 

gcgggccggc 

ccggggcggg 

tggggatcgg 

ctctttcgcc 

ggtggcggtg 

ccagaaagtg 

gaaccacacg 

cttcaaagcc 

cagccgcttt 

tggaattgat 

cagccagggt 

gactgtgtcc 

tatagtggag 

cgtggatggt 

tgtgcatcgc 

gaaggccaag 

gcatgtggga 

cttggcacag 

ggcagagcag 

tggggtgggt 

cctcagcttt 

catgaaaagc 

cctcacgcca 

ggcaggggcc 

tatgtccaac- 

tggggtcggg 

gcagagtcgg 

ctttagctga 

ccttctgctt 

tttttggtac 

tttgggtttt 

cagcctaccc 

ccagtagacc 

ggtcagacac 

gtattgtgca 

cctccatcct 

tctcagaggg 

cttggacacc 

cccctggagt 

gagtaccttt 

ttcagaaact 

gacctcctcc 

cgcctggagt 

tctgtctccc 

gtccaaacgt 

aggcttgatg 

ctctagactt 

ggtgttccac 

tctggccagg 

tctcaccccc 

accccaaaga 

ggctccctcc 

cctgactcac 

gctgctggat 

ggccccagga 

agggaacttg 

gtaggaataa 

tgctcctcgt 

tggccaagcc 

ataggtcagg 

gggaaacctt 

ccataggtag 

aactggctca 

ggacttgatt 



gggcgggcgg 
ggggccaggg 
gggctccccg 
gtgggaggta 
tccgggggag 
gccgccgccg 
gcgccgctca 
aagctattcc 
atcaatgagc 
tacagcaaga 
aagtttaagg 
gatcaggatt 
cggtgtggca 
agcgaggacg 
tgtcatggca 
gtggaaacct 
aagtatgacc 
gacttgccaa 
gaggagagta 
ctgaagatca 
gaggagatgg 
ggcaacctac 
cctcggttct 
catgaaagtt 
tacgatacaa 
gagatctcga 
atcctgacgt 
gatcgggagt 
gtttgggagg 
agaggagggg 
acccttgatt 
cttgtaacct 
gtttctgaaa 
accctcccca 
cagcacaagg 
ctcgccctgc 
caacttcctc 
gtctggaata 
gaataagttc 
tggttctcag 
tcagaagtcc 
ggatttatca 
gatgtatcaa 
caggcactct 
gctttttagc 
tgtgcctgga 
gaggtggctt 
tgggttttga 
ttagcagatc 
ctgccttcgg 
agagaaacag 
tcaaccagga 
tgtgaagatc 
aaacctggcc 
agcatatagg 
tggatgggaa 
aggggccata 
aaaccaactg 
tgtttggact 
caggcgcagc 
cagcttccga 
aaagaatgga 
ctggaagagg 
gtctgaacaa 
tggagaaaac 
tccccacgtc 



ggcggccgct 

gactgcaggg* 

gggcggggcg 

acgggacggg 

gcctgaggcc 

ccgccccggg 

gcgccagcaa 

gggccagcga 

tgagcaatgt 

tcaaggtgga 

agtattgccc 

accagaattc 

cgcgtttcct 

tggcggagat 

acacgctttt 

acatggtggt 

tcaagggttc 

cattcaaaga 

aaaagaactt 

tggactacag 

aggtggagga 

tctgctccta 

ttggtcctgg 

cccccaagaa 

agaagaaagc 

ctgtgaaccc 

agttctcttc 

tagggagaag 

gctttagcaa 

gatgtgctgt 

ttttctcccc 

tttaagatac 

tttcattgct 

tctttgctga 

gtccttccag 

tgcatttgct 

cacctttccc 

caccaggtca 

atcctcatgg 

agaaccctgt 

tctctgtaaa 

ggaccctcat 

atgaggccct 

tacttctagc 

agagacacct 

gagttacagc 

cctagttaca 

cctggcctca 

tgcagcccac 

agcctgccac 

ctctgttgtt 

actggggatt 

gcttgtgaaa 

aagcccagct 

tcaggaaaga 

accttctgga 

ggtaggtctg 

gctcatggag 

tgatttcccc 

atctctgaag 

agcaggaacc 

gacggcattc 

cagatggggg 

ctgccgcaag 

aaatttgact 

ataatgaaga 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
144 
150 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
276a 
282( 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
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atggaagttt ggatctgctc ctcgtcaggc gcagcatctc tgaagcttgg aaagctgtct 4140 

tccagcagcc tccgtggcct cgggttccta ccggcttctc tgcatttggt ctgctgatca 4200 

tgttgccata atgtgtatgg aaagtgtaac acattcttac tggttaaaga cgactaccag 4260 

gtatctaact tgtttaacat tga ~ 4283 
<210> 56 

<211> 6140 

<212> DNA 

<213> Homo sapiens 



<400> 56 

gcggccgcag cctgagccag ggccccctcc ctcgtcagga ccggggcagc aagcaggccg 60 

ggggcaggtc cgggcaccca ccatgcgagg cgagctctgg ctcctggtgc tggtgctcag 120 

ggaggctgcc cgggcgctga gcccccagcc cggagcaggt cacgatgagg gcccaggctc 180 

tggatgggct gccaaaggga ccgtgcgggg ctggaaccgg agagcccgag agagccctgg 240 

gcatgtgtca gagccggaca ggacccagct gagccaggac ctgggtgggg gcaccctggc 3 00 

catggacacg ctgccagata acaggaccag ggtggtggag gacaaccaca gctattatgt 360 

gtcccgtctc tatggcccca gcgagcccca cagccgggaa ctgtgggtag atgtggccga 420 

ggccaaccgg agccaagtga agatccacac aatactctcc aacacccacc ggcaggcttc 480 

gagagtggtc ttgtcctttg atttcccttt ctacgggcat cctctgcggc agatcaccat 540 

agcaactgga ggcttcatct tcatggggga cgtgatccat cggatgctca cagctactca 600 

gtatgtggcg cccctgatgg ccaacttcaa ccctggctac tccgacaact ccacagttgt 660 

ttactttgac aatgggacag tctttgtggt tcagtgggac cacgtttatc tccaaggctg 720 

ggaagacaag ggcagtttca ccttccaggc agctctgcac catgacggcc gcattgtctt 780 

tgcctataaa gagatcccta tgtctgtccc ggaaatcagc tcctcccagc atcctgtcaa 840 

aaccggccta tcggatgcct tcatgattct caatccatcc ccggatgtgc cagaatctcg 900 

gcgaaggagc atctttgaat accaccgcat agagctggac cccagcaagg tcaccagcat 960 

gtcggccgtg gagttcaccc cattgccgac ctgcctgcag cataggagct gtgacgcctg 1020 

catgtcctca gacctgacct tcaactgcag ctggtgccat gtcctccaga gatgctccag 1080 

tggctttgac cgctatcgcc aggagtggat ggactatggc tgtgcacagg aggcagaggg 1140 

caggatgtgc gaggacttcc aggatgagga ccacgactca gcctcccctg acacttcctt 1200 

cagcccctat gatggagacc tcaccactac ctcctcctcc ctcttcatcg acagcctcac 1260 

cacagaagat gacaccaagt tgaatcccta tgcaggagga gacggccttc agaacaacct 1320 

gtcccccaag acaaagggca ctcctgtgca cctgggcacc atcgtgggca tcgtgctggc 1380 

agtcctcctc gtggcggcca tcatcctggc tggaatttac atcaatggcc accccacatc 1440 

caatgctgcg ctcttcttca tcgagcgtag acctcaccac tggccagcca tgaagtttcg 1500 

cagccaccct gaccattcca cctatgcgga ggtggagccc tcgggccatg agaaggaggg 1560 

cttcatggag gctgagcagt gctgagaaca ccaagtctcc cctttgaaga ctttgaggcc 1620 

acagaaaaga cagttaaagc aaagaagaga agtgactttt cctggcctct cccagcatgc 1680 

cctgggctga gatgagatgg tggtttatgg ctccagagct gctgttcgct tcgtcagcac 1740 

accccgaata ttgaagaggg ggccaaaaaa caaccacatg gattttttat aggaacaaca 1800 

acctaatctc atcctgtttt gatgcaaggg ttctcttctg tgtcttgtaa ccatgaaaca i860 

gcagaagaac taacataact aactccattt ttgtttaagg ggcctttacc tattcctgca 1920 

cctaggctag gataacttta gagcactgac ataaaacgca aaaacaggaa tcatgccgtt 1980 

tgcaaaacta actctgggat taaaggggaa gcatgtaaac agctaactgt ttttgttaaa 2040 

gatttatagg aatgaggagg tttggctatt gtcacatgac agactgttag ccaaggacaa 2100 

agaagttctg caaacctccc ctggaccctt gctggtgtcc agatgtctgc ggttgtcagc 2160 

cccttccttt cccccgacct aaacataaaa gacaaggcaa agcccgcata attttaagac 222 0 

ggttctttag gacattagtc caccatcttc ttggtttgct ggctctccga aataaagtcc 228 0 

ctttccttgc tccaactcct tgtctctcaa cgtattggct atgacgcagc aagcagaatg 234 0 

aatttggact cagttacagg ctgtcaatgg tctgctctgt agcagtctca gagcctcccc 2400 

gacccactac ctggagatag ccagatagcc agatgccctg ctcctggcca cctttaaagc 2460 

ccctgcatat gacacaggtt aactaaagtc aagattgggg ctgctgcatt ccaggttccc 2520 

tagactcaca agctggtcct tggccaggtg cagtggctca cgcctgtaat cccagcactt 2580 

fc gggaggctg aggcaggcgg atcacctgaa gtcagaagtt tgagaccagc ctggccaaca 2640 

taattaaaat gtctctacta aaaatacaaa aaattagctg ggtgtggtga cgcttgcctg 2700 

tatcccagct actcaggaag ctgagacacg agaatcactt gaacctggga ggcagaggtt 2760 

gcagtgagct cagatagtgc cactgcactc cagcctgggt gacagagcga gactccgtct 2820 

caaaaaaaaa aaaagaaagc agaacctcat ggctatagag ttggcatttt agccccagct 2880 

tctgtagctc tgaaagccta aagaaggtat tctctccatc tgttaaacac agtatagtgg 294 0 

ctctcagccc ttggggcatg ttatcatggg agggaagtca aataagagga gagaaaagaa 3000 

ctcaaggggg aaactgcatt tttaggcttt gctctcttac cttgcccttt ctactcagaa 3060 
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ccaataactt 

agcccagcct 

tattctgatg 

gtacatacct 

ttataggcat 

aactgcaggt 

gcatgggagg 

aaagacacca 

cctgtggggt 

gaagtaccta 

ccatgtctcc 

aactggatct 

cagcactttg 

gggcaacaac 

gaggctgagc 

atcatgccac 

aaaaaaaaga 

ctctatgaga 

aaatccatag 

catctgccgc 

tgctttgact 

gcttcaccgc 

aatggttggc 

gtgctgtcca 

caggtggtaa 

cccactgtgt 

ttcctcaaat 

tgcctggttt 

cttcacaaag 

ctggttattg 

aatgaccctc 

atttgctgcc 

ccccctgcaa 

agctggggag 

atggccaggt 

gattgcttga 

ttaaaaataa 

tgtatgtttg 

cacacacagg 

agcttggttt 

tcaaaaagca 

ggctctctct 

ggctgcactc 

tgatgcggcc 

ctggggactg 

tctctgcatg 

tgtgcaaaag 

ggttctgatg 

cagctatctg 

gcctaaaaag 

ccatgcattt 

aaaaaaaaaa 

<210> 57 



ctgcatcaaa 
tccctgggtg 
gcctgttcgg 
ccaaaggcag 
taaagacaag 
ggattctgtt 
gacttttttt 
ggttggagag 

ggggaggcgt 

cctccttgcc 
atctggtcat 
gtccaataaa 
ggaggctgag 
aacaaaaatg 
caggaggatc 
tgtactccag 
acggaaaaag 
tgcttggttc 
gcctctcttg 
caaatacatc 
gaggccgagg 
atcaccaccc 
cgagctccca 
tggaatcagg 
atgctctgct 
cctttagaag 
tcagaccctc 
atcaagcaga 
gataaaccac 
gtccaccccc 
atgacatgta 
attttttact 
accagctaaa 
ttggtggcaa 
gcagtggctc 
gctcaggagt 
atttttaaaa 
tgtgtgtgtt 
tattttgttg 
gttaaattga 
aagtgttcac 
cctctagttc 
agactccctg 
aatctctgtc 
agggcaccca 
gattctcagt 
aaagtgcaaa 
gctgctgaac 
ctcatcagag 
aaaccccagc 
ttatctactg 
aaaaaaaaaa 



acatgttaca 

agcttgctat 

tgccgggttt 

gacttgcctg 

tcgggtggct 

gagaggcaaa 

gagacagggt 

gcacctgcca 

aggggtctac 

ttgctggact 

actccatagc 

cgcttatttg 

gtgggagggt 

ccaggtgtgg 

acttgagccc 

cctgggtgac 

aaatgcttac 

tatttttttg 

gtgacacatc 

cactccccat 

catctggaac 

cttcctccac 

gtgggaaaac 

ccacagcagg 

ggtgggctgt 

cttgaaagga 

ctfcccaccgg 

gctgaggccc 

ggagcaactg 

gaccagattc 

taaatgggta 

ctggggagaa 

tttgataatt 

ctttccaagt 

acgcctgtag 

tcaaggctgc 

tttgtgtgtt 

tgtatacagc 

ctaaagagat 

gagataaaca 

cactaaagct 

ctggaacaga 

gccaagcaaa 

tccaaaaacc 

gccttctaaa 

gttaataata 

ggatttggag 

tccgacttct 

gtgctttcct 

tgtttactta 

tctgaggaca 



gcctgcatca 

gcgcagccac 

actcactgcc 

ataaatattt 

agagggctcc 

gcacctgagt 

ctcactgtgt 

actacttgct 

ggctgcctga 

gtcagccagt 

taccaagtta 

gccaagcctg 

tgcttgagcc 

tggggtgcac 

gggaggttga 

agagtgagac 

attgtcaggg 

ggagactttg 

tctagtactt 

gccactgaca 

tttctctgcc 

tcctgacatt 

gtgtgctaga 

aactgcccca 

gcttccctca 

acccaactct 

caccccccta 

cacgtttcca 

gaaaaccatc 

ccttacactt 

tgactaagaa 

gcagccccaa 

taaaccccct 

caaggtcttg 

tcccaggtac 

agagagctat 

ttatcagggg 

cttgtttaat 

tggacaaggt 

atgacaagag 

aaccattcat 

ctcacagatt 

cctacaccag 

tacctgaact 

acacaatgta 

aaaggaagca 

tggcattccg 

tcgctgagac 

caacctcctg 

tattggtctg 

tacaataaat 



agggctttac 

atttaccatg 

ctgttctgat 

ttcctcctct 

ttgagacata 

ggttgggaca 

cgcccagggc 

ttccctggag 

gatgggtgtg 

cgcagggccg 

acctgctcta 

atggctcgtg 

caggggtttg 

ctgtagtccc 

ggctgcagtg 

cctgtctccg 

atcctgtaga 

tccaagtgtt 

tttgtcataa 

tcctatgggt 

tgcaggggct 

ctttcccttc 

gttggggagt 

ctggccattt 

ttcagagagc 

ttgctgcact 

ctccaccctc 

actctgattg 

agccaagcgt 

aactcacttc 

gaggctgtga 

ctcatcactg 

gcccctaaaa 

cttagaaagt 

ttgggagcct 

gatcatccca 

tctcctgtac 

gttttgagca 

tgtagctgtg 

ctgccagcca 

ctggttgcag 

ggcatgaagc 

aagagtcagt 

taatggtaga 

ttcatgtgtt 

ttcttttaca 

aagatcacca 

atgactgtgg 

caccacctcc 

taaatccctg 

ctgagaaagt 



cccaacctgc 
tggggctccc 
gtcagtgcct 
gaactggatt 
cctagcaggg 
caggcagctg 
aaggatgccc 
cctgcatgtg 
cacagtgtgt 
gccacaagac 
aactttggag 
cctgtactcc 
agaccagctt 
agctactagg 

gggggtcata 

aaaaaaaaaa 

caatcattaa 

ttggcttaag 

acaaacaggc 

cagccaggct 

agcagcagag 

agggatccaa 

gagatgagtg 

gagacacaca 

tctgttacag 

gtcctttttc 

agctcttcct 

tcacttgcat 

tcggatgagt 

tttctttggc 

tctaacattt 

ggaaagaact 

cttctcacag 

ccttcactac 

gaggcaggag 

ctgcatttgt 

agtgtatctg 

ataagatatg 

ctcaggcttc 

accacactat 

gcaaggctaa 

actgatcagg 

gtcacagata 

attcaaagat 

tagtgtaaac 

actcctgctg 

cacatacctt 

gaacagcctc 

aagagaaaca 

gaagtaaacc 

ctatgctgtc 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380^ 

444(19 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

57 6 A 

5820 

5880 

5940 

6000 

6060 

6120 

6140 



<211> 2098 



<212> DNA 

<213> Homo sapiens 



<400> 57 

gcaggagcac gtggagaggc cgggtagcca cagcggcagc tccagcccgg cccggcagcg 60 

acatggaaga tatacaaaca aatgcggaac tgaaaagcac tcaggagcag tctgtgcccg 120 

cagaaagtgc agcggttttg aatgactaca gtttaaccaa atctcatgaa atggaaaatg 180 
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tggacagtgg agaaggccca gccaatgaag atgaagacat aggagatgat tcaatgaaag 240 

tgaaagatga atacagtgaa agagatgaga atgttttaaa gtcagaaccc atgggaaatg 300 

cagaagagcc tgaaatccct tacagctatt caagagaata taatgaatat gaaaacatta 360 

agttggagag acatgttgtc tcattcgata gtagcaggcc aaccagtgga aagatgaact 420 

gcgatgtgtg tggattatcc tgcatcagct tcaatgtctt aatggttcat aagcgaagcc 480 

atactggtga acgcccattc cagtgtaatc agtgtggggc atcttttact cagaaaggta 540 

acctcctccg ccacattaaa ctgcacacag gggaaaaacc ttttaagtgt cacctctgca 600 

actatgcatg ccaaagaaga gatgcgctca cggggcatct taggacacat tctgtggaga 660 

aaccctacaa atgtgagttt tgtggaagga gttacaagca gagaagttcc cttgaggagc 720 

acaaggagcg ctgccgtaca tttcttcaga gcactgaccc aggggacact gcaagtgcgg 780 

aggcaagaca catcaaagca gagatgggaa gtgaaagagc tctcgtactg gacagattag 840 

caagcaatgt ggcaaaacga aaaagctcaa tgcctcagaa attcattggt gagaagcgcc 900 

actgctttga tgtcaactat aattcaagtt acatgtatga gaaagagagt gagctcatac 960 

agacccgcat gatggaccaa gccatcaata acgccatcag ctatcttggc gccgaagccc 1020 

tgtgcccctt ggtccagaca ccgcctgctc ccacctcgga gatggttcca gttatcagca 1080 

gcatgtatcc catagccctc acccgggctg agatgtcaaa cggtgcccct caagagctgg 1140 

aaaggaaaag catcctcctt ccagagaaga gcgtgccttc tgagagaggc ctctctccca 1200 

acaatagtgg ccacgactcc acggacactg acagcaacca tgaagaacgc cagaatcaca 1260 

tctatcagca aaatcacatg gtcctgtctc gggcccgcaa tgggatgcca cttctgaagg 1320 

aggttccccg ctcttacgaa ctcctcaagc ccccgcccat ctgcccaaga gactctgtca 1380 

aagtgatcga caaggaaggg gaggtgatgg atgtgtatcg gtgtgaccac tgccgcgtcc 1440 

tcttcctgga ctatgtgatg ttcacgattc acatgggctg ccacggcttc cgtgaccctt 1500 

tcgagtgtaa catgtgtgga gatcgaagcc atgatcggta tgaattctcg tctcacatag 1560 

ccagaggaga acacagaagc ctgctgaagt gaatatctgg tctcagggat tgctcctatg 1620 

tattcagcat cgtttctaaa aacagttgac ctcgcctaac agattgctct caaaacatac 1680 

tcagttccaa acttcttttc ataccatttt tagctgtgtt cacaggggta gccagagaaa 1740 

cactgtcttc cttcagaaat tattcgcagg tctagcatat tattactttt gtgaaacctt 1800 

tgttttccca tcagggactt gaattttatg gaatttaaaa gccaaaaagg tatttggtca 1860 

ttatcttcta cagcagtgga atgagtggtc ccggagatgt gctatatgaa acattctttc 192 0 

tgagatatat caaccacacg tggaaaagcc tttcagtcat acatgcaaat ccacaaagag 1980 

gaagagctga ccagctgacc ttgctgggaa gcctcaccct tctgcccttc acaggctgaa 2040 

gggttaagat ctaatctccc taatctaaat gacagtctaa gagtaagtaa aaaaacaa 2098 
<210> 58 ^ 

<211> 2947 

<212> DNA 

<213> Homo sapiens 



<400> 58 

atgccaattc ctcctccccc gccaccccca cctggtcctc ctccacctcc cacatttcat 60 

caggcaaaca cagagcagcc caagctgagt agagatgagc agcggggtcg aggcgccctc 120 

ttacaggaca tttgcaaagg gaccaagctg aagaaggtga ccaacattaa tgatcggagt 180 

gctcccatcc tcgagaagcc gaaaggaagc agtggtggct atggctctgg aggagctgcc 240 

ctgcagccca agggaggtct cttccaagga ggagtgctga agcttcgacc tgtgggagcc 300 

aaggatggtt cagagaacct agctggtaag ccagccctgc aaatccccag ttctcgagct 360 

gctgccccaa ggcctccagt atctgccgcc agcgggcgtc ctcaggatga tacagacagc 42 0 

agccgggcct cactcccaga actgccccgg atgcagagac cctctttacc ggacctctct 480 

cggcctaata ccaccagcag tacgggcatg aagcacagct cctctgcccc tcccccacca 540 

cccccagggc ggcgtgccaa cgcacccccc acacctctgc ctatgcacag cagcaaagcc 600 

cccgcctaca acagagagaa acccttgcca ccgacgcctg gacaaaggct tcaccctggt 660 

cgagagggac ctcctgctcc acccccagtc aaaccacctc cttcccctgt gaatatcaga 72 0 

acaggaccaa gtggccagtc tctggctcct cctcctccgc cttaccgcca gcctcctggg 780 

gtccccaatg gaccctctag ccccactaat gagtcagccc ctgagctgcc acagagacac 840 

aattctttgc ataggaagac accagggcct gtcagaggcc tagcacctcc tccacccacc 900 

tcggcctccc catctttact gagtaatagg ccacctcccc cagcccgaga ccctcccagt 960 

cggggagcag ctcctccacc cccaccacct gtgatccgaa atggtgccag ggatgctccc 1020 

cctcccccac caccataccg aatgcatggg tcagaacccc cgagccgagg aaagccccca 1080 

cctccaccct caaggacgcc agctgggcca ccccctcctc ctccaccgcc cctgaggaat 1140 

ggccacagag attctatcac cactgtccgg tctttcttgg atgattttga gtcaaagtat 1200 

tccttccatc cagtagaaga ctttcctgct ccagaagaat ataaacactt tcagaggata 1260 

tatcccagca aaacaaaccg agctgcccgt ggagccccac ctctgccacc cattctcagg 1320 

tgaagcctgg cttggtcccg ttcctcagga aaaggatgga ccttctcttc ttctcagatg 1380 
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gtcccttcca ttcccctgaa acctgcatga gagctcctaa catgtttctc caatgcaatc 1440 

aagccctaga ctccaaatgt cctcccagct cacctccatc tatgcatctc atctctggat 1500 

ttggtgatca gactctatat tgacagtagg atctcaaacc ctgcatccat ccttcctcca 1560 

gcaagccctg ctagccacat gaggaacaag tttccgtgtc ttctgccttc ctcttgggga 1620 

aaggtgcctt gttgtgatga attaactcac tgttagggca gggtggagaa tggtactcct 1680 

tccttctcct gtccactgtg ggggaagctt ggcaggtata ttatatttca tcatttagga 1740 

ggctggcatg accaggactt atgggtggga ggggagcatt tttagtgaag caagaaagga 1800 

gtttgccaag aagtgatctg ttttaaaggt catatttgga gaaagggcaa ggaattgggt 1860 

ctgctttatt tttgggggta ttttgttttt gttctcacct gctgcccccc caccccacca 1920 

ccccagggat aaattggata taaacactaa atactaatca gttgaactta acatttaata 1980 

aaaagaaagg gtgaaataaa ctgaagacca ttttagaact agtcagttct ctgcagcaaa 2040 

gggaacagga gccatttgaa ccctctggga cccctcaccc cactgcttca gggtgctagg 2100 

ctgagggatg tttttcctcc cccttaccgc ccatgccctt gaaagaaaag tcactttttg 2160 

tggagggcat cattcattcc tgattcacaa accccaaaaa cctctggtgg gagataggaa 2220 

gatagggcgt gggcctgggc cttaacctca atcttgtgtc tgcctcagtc ttttctgact 2280 

ggccctgaag ttgtcagtgg ctctttctgt ccttcagccc ctggaaggtg ctccaggata 2340 

acaaagaagg gcaggttgaa gcccctcatg gaaggagctg gctttgtggg gctgcaaagg 2400 

acttttaagt cctgcctgta ctgaagttca cagcccacct gactgagcag actcttcctg 2460 

ttcctttctc taccaccctt gccttcccag gactgcacgg tttaacacag cagagtacag 2520 

aagggtgaag aagtgagcag aggcttatga agatattcag atactcttct atgccaggaa 2580 

gcacaaagac tttgttgaga tttgcctcag ttcagtagat cttccttggc agccagccat 2640 

aggttgtttc tttgtcttcc gggtcctaaa gagcacagag aaaatggagg tccccagtct 2700 

aggtaggaag ctgattggat gaggacttct ttttttccga cagcaggatg gggctcttgg 276(4 

gctccacaca ccagatgctt tggttttcta caactgttgc tatgtgtaga gggtgctcag 2820 

agcgtggcat gagagcaagg agaccatggc tactctttga aatggatggg gaaaattagc 28 80 

ttaaaaattt aatcacgaga ttgcgccact gcactccagc ctgggcgaca gagccagact 294 0 

ccgtctc 2947 
<210> 59 

<211> 784 

<212> DNA 

<213> Homo sapiens 



<400> 59 

gagcggttgc 

tcctagtaca 

aaggggaaga 

gtgaccacgg 

gggctcgcca 

gagttgaagg 

ttgtatgaga 

aagaccttta 

gactttgtgg 

qaqcccaaca 

gtggaccggg 

atcaccacca 

tttctttttt 

aaaa 

<210> 60 



gcagtgaagg 
ccgcaatcat 
actgtgtggc 
acttccagaa 
ctgacgtcca 
aaggtcggca 
aacggtttgg 
agcccttcat 
tcagtggcac 
tggatccgga 
atgcagtgtc 
ggacactgaa 
tgaaataaaa 



ctagacccgg 
gtctattatg 
catcgctgca 
gatctttccc 
gacagttgcc 
gatcaaacct 
cccttactac 
ttgctctcta 
ctgcgccgaa 
tcacctgttt 
aggcatggga 
ggcccgaatg 
tagcctgtct 



tttactggaa 
tcctataacg 
gacaggcgct 
atgggtgacc 
cagcgcctca 
tataccctca 
actgagccag 
gacctcatcg 
caaatgtacg 
gaaaccatct 
gtcattgtcc 
gactaaccct 
ttcaaaaaaa 



ttgctctggc 
gaggggccgt 
tcgggatcca 
ggctgtacat 
agttccggct 
tgagcatggt 
tcattgccgg 
gctgccccat 
gaatgtgtga 
cccaagccat 
acatcatcga 
gttcccagag 
aaaaaaaaaa 



gatcgagggg 
catggccatg 
ggcccagatg 
cggtctggcc 
gaacctgtat 
ggccaacctc 
gttggacccg 
ggtgactgat 
gtccctctgg 
gctgaatgct 
gaaggacaaa 
cccacttttt 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
42 
48 
540 
600 
660 
720 
780 
784 



u 

P 



<211> 3033 

<212> DNA 

<213> Homo sapiens 



<400> 60 

atactcctaa gctcctcccc cggcggcgag ccagggagaa aggatggccg gcctggcggc 60 

gcggttggtc ctgctagctg gggcagcggc gctggcgagc ggctcccagg gcgaccgtga 12 0 

gccggtgtac cgcgactgcg tactgcagtg cgaagagcag aactgctctg ggggcgctct 18 0 
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73 



gaatcacttc 

cgactgtaag 

agtgcctcag 

atcggccgtg 

caccttcgtg 

cctcaatgca 

aatggactac 

gtgagcctgc 

ccagccttcc 

aacacatcag 

aaggagctgt 

ttcaggggcc 

a tgggccacc 

cccagctgtg 

ctacctgagc 

cctggtcaac 

cgtgcgcaag 

tgacttccca 

ccctgtccac 

atcagaggac 

cctgcccccg 

caacttcttg 

ctgttggccc 

cgagagtggg 

tagctctacc 

agctgcctgg 

tagggataca 

gtggtgctgg 

cttgagggac 

ggttcccagg 

aaggatcctg 

ggctgagtgc 

taggtgcgca 

gtgcaggggg 

atgtgagatg 

ggaatcctgt 

gggcggacag 

gcccctttgc 

gcagcccctc 

tttctccttg 

gcccctttgc 

gaaagcttat 

gccccagggc 

ttaccagcag 

ctttccatcc 

accccttgtc 

gtggcctgcg 

tgtgcttaat 

<210> 61 



cgctcccgcc 
tatgagtgta 
ttccatggca 
gcctcgtttc 
ccagcctcct 
tggttctggt 
ttctgtgcct 
ctgggtggct 
cctgggggcc 
ctgtcctggg 
agcacttgct 
atgcaccaat 
tgccctctga 
gtcagtgcct 
ctcatccgct 
gtggtgtggt 
tgcgtggtgg 
ccgctcttct 
gtcctctttt 
aagttcaagc 
ccctgctggc 
aacttggaca 
tcaccagcct 
cagcccctct 
9ggaggacag 
ctgggtcctg 
gggggtcccc 
atgcttgttc 
caagctgctg 
aggcctcacc 
tgctgctgtc 
ataggtgaca 
gtgtggagac 
tgggtgtgtt 
agtgactgcc 
caccatcaat 
ccaggagctc 
ccgcctcctg 
ggaggcagag 
ccttcgcatg 
catagcctga 
ggctgggtct 
aggcaggggc 
ccctggcatg 
caaaggcagt 
cctctttgag 
ctagcttctt 
aaatcaattc 



agccaatcta 
tgtgggtcac 
a gtggccctt 
tcaatggcct 
cccccatgta 
ccacagtctt 
ccactgtcat 
gcaggggcaa 
ttcttcacta 
tgaggactct 
gcccagctgt 
9ttgggggga 
gcaaccctgg 
tccgggctct 
tcgactatgg 
ggctggcctg 
tggtcttgct 
gggtcctgga 
tcagctttct 
tggactgaag 
ctcccttctc 
tgaaggatgt 
tggagtctgt 
acctcctgga 
ctgcctgttt 
aagccctctg 
ttctgttacc 
tttggccagc 
ggattgggaa 
atactccctt 
tggttgagag 
gggccgtgag 

gggtgttgtc 

agcgtgggtt 
ggtgaatgtg 
aatcacttgt 
tccatggcca 
caaacctcac 
gaaggaaaat 
gctggccttc 
ttttggggag 
ggtttcttcc 
cacactatgc 
ttcctgcccc 
ctccgtggtt 
ggaggggagc 
ttgatactga 
caagcctcac 



catgagtcta 

cgttgggctc 

ctcccggttc 

ggccagcctg 

ccacacctgt 

ccacaccagg 

cctacactca 

aatcgaaccc 

gtctcccaac 

ggggtaggac 

ggcctgtttg 

ggagatgctt 

acggtggggc 

cctgctgctc 

ctacaacctg 

gtgcctgtgg 

gctgcagggg 

tgcccatgcc 

ggaagatgac 

accttggagc 

ccctcaaccc 

gggcccagaa 

tctagggaag 

gctgaactgg 

cctccccacc 

tctacctggg 

accccccacc 

caaggttcac 

ggagtttcac 

tcagggccag 

cctgccaccg 

catgggcctg 

ggggaagagg 

aggggaacgt 

tccacagttg 

ggagcgccag 

ggctgcctgt 

agggtcccca 

ggggatggct 

ccctccaaaa 

gaggaagggg 

cttcccagag 

ctgcgccctg 

acaggaatag 

gaagcagact 

tatgctagga 

aaacttttaa 

ctg 



gcaggctgga 

tacctccagg 

ctgttctttc 

gtgatgctct 

gtggccttcg 

gacactgacc 

atctacctgt 

tgggggcaga 

acctacgccc 

tgggggccct 

gtggggagag 

cagggaatgc 

aggaccgtgg 

atgctgaccg 

gtggccaacg 

aaccagcggc 

ctgtccctgc 

atctggcaca 

agcctgtacc 

gagtctgccc 

ttgagatgat 

tcatgtggcc 

gcctcccagc 

ggtggaactg 

agcctcctcc 

agaccaggga 

ctcctccagg 

ggcgattctc 

cctgaccgtt 

ggctccagca 

tgtgtcggga 

ggtgtgtgtg 

tgtggcttca 

gtgtgcgcgt 

agaggttgga 

ctctgcccaa 

gtgcatgttc 

cacaacagtg 

ggggctctct 

cctccattcc 

cgatttgagg 

ggtcttactg 

gtaaaggtga 

aatggaggga 

ggatttttgc 

ctccaacctc 

ggtgggaggg 



cctgtcggga 
aaggtcacaa 
aagagccggc 
gccgctaccg 
cctgggtgtc 
tcacagagaa 
gctgcgtcag 

aa ggggtcac 

cccaaccccc 
ggctcctgac 
gggtagtgac 
tgctctgggg 
ggctgcagca 
tgcacgtctc 
tggctattgg 
ggctgcctca 
tcgagctgct 
tcagcaccat 
tgctgaagga 
cagtggggat 
tttctctttt 
agcccacccc 
atctgggact 
agtgtgttct 
ccacatcccc 
ccacaggcct 
acaccactag 
cccatgggat 
gccctagcca 
agcccagggc 
gtgtgggcca 
agctcaggcc 
aagtgtgtgt 
gctggtgggc 
gcaggatgag 
gacgccacct 
cctgtctggt 
ccctccagaa 
ccatcctcct 
cctgctgcca 
gagaagggga 
ttccagggtg 
cccctgccat 
gctccagaaa 
tctgcccctg 

a ggg a <=tcgg 

tggcaaggga 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
.2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3033 



<211> 1174 

<212> DNA 

<213> Homo sapiens 



<400> 61 

aagctcctcc 

tcctgctagc 

accgcgactg 

tccgctcccg 

agtatgagtg 

agttccatgg 

tggcctcgtt 



cccggcggcg 
tggggcagcg 
cgtactgcag 
ccagccaatc 
tatgtgggtc 
caagtggccc 
tctcaatggc 



agccagggag 
gcgctggcga 
tgcgaagagc 
tacatgagtc 
accgttgggc 
ttctcccggt 
ctggccagcc 



aaaggatggc 
gcggctccca 
a gaactgctc 
tagcaggctg 
tctacctcca 
tcctgttctt 
tggtgatgct 



cggcctggcg 

gggcgaccgt 
tgggggcgct 
gacctgtcgg 
ggaaggtcac 
tcaagagccg 
ctgccgctac 



gcgcggttgg 
gagccggtgt 
ctgaatcact 
gacgactgta 
aaagtgcctc 
gcatcggccg 
cgcaccttcg 



60 
120 
180 
240 
300 
360 
420 
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tgccagcctc ctcccccatg taccacacct gtgtggcctt cgcctgggtg tccctcaatg 480 

catggttctg gtccacagtc ttccacacca gggacactga cctcacagag aaaatggact 540 

acttctgtgc ctccactgtc atcctacact caatctacct gtgctgcgtc aggaccgtgg 600 

ggctgcagca cccagctgtg gtcagtgcct tccgggctct cctgctgctc atgctgaccg 660 

tgcacgtctc ctacctgagc ctcatccgct tcgactatgg ctacaacctg gtggccaacg 720 

tggctattgg cctggtcaac gtggtgtggt ggctggcctg gtgcctgtgg aaccagcggc 780 

ggctgcctca cgtgcgcaag tgcgtggtgg tggtcttgct gctgcagggg ctgtccctgc 840 

tcgagctgct tgacttccca ccgctcttct gggtcctgga tgcccatgcc atctggcaca 900 

tcagcaccat ccctgtccac gtcctctttt tcagctttct ggaagatgac agcctgtacc 960 

tgctgaagga atcagaggac aagttcaagc tggttgaagc agactggatt tttgctctgc 1020 

ccctgacccc ttgtccctct ttgagggagg ggagctatgc taggactcca acctcaggga 1080 

ctcgggtggc ctgcgctagc ttcttttgat actgaaaact tttaaggtgg gagggtggca 1140 

agggatgtgc ttaataaatc aattccaagc ctca 1174 
<210> 62 

<211> 3167 ...... 

<212> DNA 

<213> Homo sapiens 




<400> 62 

aagctcctcc cccggcggcg agccagggag aaaggatggc cggcctggcg gcgcggttgg 60 

tcctgctagc tggggcagcg gcgctggcga gcggctccca gggcgaccgt gagccggtgt 120 

accgcgactg cgtactgcag tgcgaagagc agaactgctc tgggggcgct ctgaatcact .180 

tccgctcccg ccagccaatc tacatgagtc tagcaggctg gacctgtcgg gacgactgta 240 

agtatgagtg tatgtgggtc accgttgggc tctacctcca ggaaggtcac aaagtgcctc 300 

agttccatgg caagtggccc ttctcccggt tcctgttctt tcaagagccg gcatcggccg 360 

tggcctcgtt tctcaatggc ctggccagcc tggtgatgct ctgccgctac cgcaccttcg 420 

tgccagcctc ctcccccatg taccacacct gtgtggcctt cgcctggatg agaaaactga 480 

ggcacagcaa ggctaaataa cttgcccaag gacacacagg aaatgcagag ccaggaactg 54 0 

aaccctggca gtctggctgt agggcttgca ttcttaatga taccactacc tcccaaatct 600 

gaggaaaggg tgtccctcaa tgcatggttc tggtccacag tcttccacac cagggacact 660 

gacctcacag agaaaatgga ctacttctgt gcctccactg tcatcctaca ctcaatctac 720 

ctgtgctgcg tcaggtgagc ctgcctgggt ggctgcaggg gcaaaatcga accctggggg 780 

cagaaagggg tcacccagcc ttcccctggg ggccttcttc actagtctcc caacacctac 840 

gccccccaac ccccaacaca tcagctgtcc tgggtgagga ctctggggta ggactggggg 900 

ccctggctcc tgacaaggag ctgtagcact tgctgcccag ctgtggcctg tttggtgggg 960 

agaggggtag tgacttcagg ggccatgcac caatgttggg gggaggagat gcttcaggga 1020 

atgctgctct ggggatgggc cacctgccct ctgagcaacc ctggacggtg gggcaggacc 1080 

gtggggctgc agcacccagc tgtggtcagt gccttccggg ctctcctgct gctcatgctg 1140 

accgtgcacg tctcctacct gagcctcatc cgcttcgact atggctacaa cctggtggcc 1200 

aacgtggcta ttggcctggt caacgtggtg tggtggctgg cctggtgcct gtggaaccag 126^ 

cggcggctgc ctcacgtgcg caagtgcgtg gtggtggtct tgctgctgca ggggctgtcc 1320 

ctgctcgagc tgcttgactt cccaccgctc ttctgggtcc tggatgccca tgccatctgg 1380 

cacatcagca ccatccctgt ccacgtcctc tttttcagct ttctggaaga tgacagcctg 1440 

tacctgctga aggaatcaga ggacaagttc aagctggact gaagaccttg gagcgagtct 1500 

gccccagtgg ggatcctgcc cccgccctgc tggcctccct tctcccctca acccttgaga 1560 

tgattttctc ttttcaactt cttgaacttg gacatgaagg atgtgggccc agaatcatgt 162 0 

ggccagccca ccccctgttg gccctcacca gccttggagt ctgttctagg gaaggcctcc 1680 

cagcatctgg gactcgagag tgggcagccc ctctacctcc tggagctgaa ctggggtgga 174 0 

actgagtgtg ttcttagctc taccgggagg acagctgcct gtttcctccc caccagcctc 1800 

ctccccacat ccccagctgc ctggctgggt cctgaagccc tctgtctacc tgggagacca 1860 

gggaccacag gccttaggga tacagggggt ccccttctgt taccaccccc caccctcctc 1920 

caggacacca ctaggtggtg ctggatgctt gttctttggc cagccaaggt tcacggcgat 1980 

tctccccatg ggatcttgag ggaccaagct gctgggattg ggaaggagtt tcaccctgac 2040 

cgttgcccta gccaggttcc caggaggcct caccatactc cctttcaggg ccagggctcc 2100 

agcaagccca gggcaaggat cctgtgctgc tgtctggttg agagcctgcc accgtgtgtc 2160 

gggagtgtgg gccaggctga gtgcataggt gacagggccg tgagcatggg cctgggtgtg 2220 

tgtgagctca ggcctaggtg cgcagtgtgg agacgggtgt tgtcggggaa gaggtgtggc 2280 

ttcaaagtgt gtgtgtgcag ggggtgggtg tgttagcgtg ggttagggga acgtgtgtgc 2340 

gcgtgctggt gggcatgtga gatgagtgac tgccggtgaa tgtgtccaca gttgagaggt 24 00 

tggagcagga tgagggaatc ctgtcaccat caataatcac ttgtggagcg ccagctctgc 2460 

ccaagacgcc acctgggcgg acagccagga gctctccatg gccaggctgc ctgtgtgcat 2520 
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tggtgcccct 

agaagcagcc 
tccttttctc 
gccagcccct 
gggagaaagc 
ggtggcccca 
ccatttacca 
gaaactttcc 
cctgacccct 
tcgggtggcc 
gggatgtgct 



gttccctgtc 
agtgccctcc 
ctctccatcc 
ttcccctgct 
gagggagaag 
actgttccag 
gtgacccctg 
9ggagctcca 
ttgctctgcc 
cctcagggac 
agggtggcaa 
<210> 63 

<211> 2733 

<212> DNA 



<213> Homo sapiens 
<220> 

<22 1 > misc_f eature 

<222> (2694) . . (2694) 

<223> n=a, c, g or t 

<220> 

< 2 2 1 > mis cofeature 

<222> (2724) . . (2724) 

<223> n=a, c, g or t 



ttgcccgcct 
cctcggaggc 
cttgccttcg 
ttgccatagc 
ttatggctgg 
gggcaggcag 
gcagccctgg 
atcccaaagg 
tgtccctctt 
tgcgctagct 
taataaatca 



-75- 

cctgcaaacc 
agaggaagga 
catggctggc 
ctgattttgg 
gtctggtttc 
gggccacact 
catgttcctg 
cagtctccgt 
tgagggaggg 
tcttttgata 
attccaagcc 



tcacagggtc 
aaatggggat 
cttcccctcc 
ggaggaggaa 
ttcccttccc 
atgcctgcgc 
ccccacagga 
ggttgaagca 
gagctatgct 
ctgaaaactt 
tcacctg 



cccacacaac 
ggctggggct 
aaaacctcca 
ggggcgattt 
agagggtctt 
cctggtaaag 
atagaatgga 
gactggattt 
aggactccaa 
ttaaggtggg 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3167 



<400> 63 

agggagaaag 

tggcgagcgg 

aagagcagaa 

tgagtctagc 

ttgggctcta 

cccggttcct 

ccagcctggt 

acacctgtgt 

acaccaggga 

ctcaatctac 

ccttccgggc 

gcttcgacta 

ggtggctggc 

tggtggtctt 

tctgggtcct 

ttttcagctt 

agctggactg 

ggcctccctt 

acatgaagga 

ccttggagtc 

tctacctcct 

agctgcctgt 

tgaagccctc 

cccttctgtt 

ttctttggcc 

ctgggattgg 



gatggccggc 

ctcccagggc 

ctgctctggg 

aggctggacc 

cctccaggaa 

gttctttcaa 

gatgctctgc 

ggccttcgcc 

cactgaccta 

ctgtgctgcg 

tctcctgctg 

tggctacaac 

ctggtgcctg 

gctgctgcag 

ggatgcccat 

tctggaagat 

agaccttgga 

ctcccctcaa 

tgtgggccca 

tgttctaggg 

ggactgaact 

ttcctcccca 

tgtctacctg 

accacccccc 

agccaaggtt 

gaaggagttt 



ctggcggcgc 

gaccgtgagc 

ggcgctctga 

tgtcgggacg 

ggtcacaaag 

gagccggcat 

cgctaccgca 

tgggtgtccc 

cagagaaaat 

tcaggaccgt 

ctcatgctga 

ctggtggcca 

tggaaccagc 

gggctgtccc 

gccatctggc 

gacagcctgt 

gcgaagtctg 

cccttgagat 

gaatcatgtg 

aaggcctccc 

ggggtggaac 

ccagcctcct 

ggagaccagg 

accctcctcc 

cacggcgatt 

caccctgacc 



ggttggtcct 

cggtgtaccg 
atcacttccg 
actgtaagta 
tgcctcagtt 
cggccgtggc 
ccttcgtgcc 
tcaatgcatg 
ggactacttc 

ggggctgcag 

ccgtgcacgt 
acgtggctat 
ggcggctgcc 
tgctcgagct 
acatcagcac 
acctgctgaa 
ccccagtggg 
gattttctct 
gccagcccac 
agcatctggg 
tgagtgtgtt 
ccccacatcc 
gtaccacagg 
aggacaccac 
ctccccatgg 
gttgccctag 



gctagctggg 

cgactgcgta 

ctcccgccag 

tgagtgtatg 

ccatggcaag 

ctcgtttctc 

agcctcctcc 

gttctggtcc 

tgtgcctcct 

cacccagctg 

ctcctacctg 

tggcctggtc 

tcacgtgcgc 

gcttgacttc 

catccctgtc 

ggaatcagag 

gatcctgccc 

tttcaacttc 

cccctgttgg 

actcgagagt 

cttagctcta 

ccagctgcct 

ccttagggat 

taggtggtgc 

gatcttgagg 

ccaggttccc 



gcagcggcgc 

ctgcagtgcg 

ccaatctaca 

tgggtcaccg 

tggcccttct 

aatggcctgg 

cccatgtacc 

acagtcttcc 

gtatcctaca 

tggtcaagtg 

agcctcatcc 

aacgtggtgt 

aagtgcgtgg 

ccaccgctct 

cacgtcctct 

gacaagttca 

ccgccctgct 

ttgaacttgg 

ccctcaccag 

gggcagcccc 

ccgggaggac 

ggctgggtcc 

acagggggtc 

tggatgcttg 

gaccaagctg 

aggaggcctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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76- 



accatactcc 
gtctggttga 
acagggccgt 
gacgggtgtt 
gttagcgtgg 
gccggtgaat 
aataatcact 
ctctccatgg 
ctgcaaacct 
gaggaaggaa 
atggctggcc 
tgattttggg 
tctggtttct 
cacactatgc 
ttcctgcccc 
ctccgtggtt 
ggaggggagc 
ttgatactga 
ttcaaaaagc 
gggtggggta 
<210> 64 



ctttcagggc 
gagcctgcca 
gagcatgggc 
gtcggggaag 
gttaggggaa 
gtgtccacag 
tgtggagcgc 
ccaggctgcc 
cacagggtcc 
aatggggatg 
ttcccctcca 
gaggaggaag 
tcccttccca 
ctgcgccctg 
acaggaatag 
gaagcagact 
tatgctagga 
aaacttttaa 
ttctcgagag 
cccaggtaag 



cagggctcca 
ccgtgtgtcg 
ctgggtgtgt 
aggtgtggct 
cgtgtgtgcg 
ttgagaggtt 
cagctctgcc 
tgtgtgcatg 
ccacacaaca 
gctggggctc 
aaacctccat 

gggcgatttg 
gagggtctta 
gtaaaggtga 
aatggaggga 
ggatttttgc 
ctccaacctc 
ggtgggaggg 
tacttctaga 
tgtnccccat 



gcaagcccag 
ggagtgtggg 
gtgagctcag 
tcaaagtgtg 
cgtgctggtg 
ggagcaggat 
caagacgcca 
ttccctgtct 
gtgccctcca 
tctccatcct 
tcccctgctg 
agggagaagg 
ctgttccagg 
cccctgccat 
gctccagaaa 
tctgcccctg 
agggactcgg 
tggcaaggga 
gcggccgcgg 
ate 



ggcaaggatc 
ccaggctgag 
gcctaggtgc 
tgtgtgcagg 
ggcatgtgag 
gagggaatcc 
cctgggcgga 
ggtgcccctt 
gaagcagccc 
ccttttctcc 
ccagcccctt 
ggagaaagct 
gtggccccag 
ttaccagcag 
ctttccatcc 
accccttgtc 
gtggcctgcg 
tgtgcttaag 
gcccatcgat 



ctgtgctgct 
tgcataggtg 
gcagtgtgga 
gggtgggtgt 
atgagtgact 
tgtcaccatc 
cagecaggag 
tgcccgcctc 
cteggaggea 
ttgccttcgc 
tgccatagcc 
tatggctggg 
gcagcagggc 
ccctggcatg 
caaaggcagt 
cctctttgag 
ctagcttctt 
cggccgcgaa 
tttnccaccc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2733 



<211> 2546 

<212> DNA 

<213> Homo sapiens 



<400> 64 

aagctcctcc 

tcctgctagc 

accgcgactg 

tccgctcccg 

agtatgagtg 

agttccatgg 

tggcctcgtt 

tgccagcctc 

catggttctg 

acttctgtgc 

aacgtggtgt 

aagtgcgtgg 

ccaccgctct 

cacgtcctct 

gacaagttca 

ccgccctgct 

ttgaacttgg 

ccctcaccag 

gggcagcccc 

acegggagga 

tggctgggtc 

acagggggtc 

tggatgcttg 

gaccaagctg 

aggaggcetc 

ctgtgctgct 

tgcataggtg 

gcagtgtgga 

gggtgggtgt 

atgagtgact 

tgtcaccatc 

cagecaggag 

tgcccgcctc 

cteggaggea 

ttgccttcgc 



cccggcggcg 

tggggcagcg 

cgtactgcag 

ccagccaatc 

tatgtgggtc 

caagtggccc 

tctcaatggc 

ctcccccatg 

gtccacagtc 

ctccactgtc 

ggtggctggc 

tggtggtctt 

tctgggtcct 

ttttcagctt 

agctggactg 

ggcctccctt 

acatgaagga 

ccttggagtc 

tctacctcct 

cagctgcctg 

ctgaagccct 

cccttctgtt 

ttctttggcc 

ctgggattgg 

accatactcc 

gtctggttga 

acagggcegt 

gacgggtgtt 

gttagcgtgg 

gccggtgaat 

aataatcact 

ctctccatgg 

ctgcaaacct 

gaggaaggaa 

atggctggcc 



agecagggag 

gcgctggcga 

tgegaagage 

tacatgagtc 

accgttgggc 

ttctcccggt 

ctggccagcc 

taccacacct 

ttccacacca 

atcctacact 

ctggtgcctg 

getgetgeag 

ggatgeccat 

tctggaagat 

aagaccttgg 

ctcccctcaa 

tgtqggccca 

tgttctaggg 

ggagctgaac 

tttcctcccc 

ctgtctacct 

accacccccc 

agecaaggtt 

gaaggagttt 

ctttcagggc 

gagcctgcca 

gagcatgggc 

gtcggggaag 

gttaggggaa 

gtgtccacag 

tgtggagcgc 

ccaggctgcc 

cacagggtcc 

aatggggatg 

ttcccctcca 



aaaggatggc 

gcggctccca 

agaactgetc 

tagcaggctg 

tctacctcca 

tcctgttctt 

tggtgatgct 

gtgtggcctt 

gggacactga 

caatctacct 

tggaaccagc 

gggctgtccc 

gccatctggc 

gaeagectgt 

agegagtctg 

cccttgagat 

gaatcatgtg 

aaggcctccc 

tggggtggaa 

accagcctcc 

gggagaccag 

accctcctcc 

caeggegatt 

caccctgacc 

cagggctcca 

ccgtgtgtcg 

ctgggtgtgt 

aggtgtggct 

cgtgtgtgcg 

ttgagaggtt 

cagctctgcc 

tgtgtgcatg 

ccacacaaca 

gctggggctc 

aaacctccat 



cggcctggcg 

gggcgaccgt 

tgggggeget 

gaectgtegg 

ggaaggtcac 

teaagagecg 

ctgccgctac 

cgcctgggtg 

cctcacagag 

gtgctgcgtc 

ggcggctgcc 

tgetcgaget 

acatcagcac 

acctgetgaa 

ccccagtggg 

gattttctct 

gccagcccac 

agcatctggg 

ctgagtgtgt 

tccccacatc 

ggaccacagg 

aggacaccac 

ctccccatgg 

gttgecctag 

gcaagcccag 

ggagtgtggg 

gtgagctcag 

tcaaagtgtg 

cgtgctggtg 

ggagcaggat 

caagacgcca 

ttccctgtct 

gtgccctcca 

tctccatcct 

tcccctgctg 



gcgcggttgg 

gagccggtgt 

ctgaatcact 

gacgactgta 

aaagtgcctc 

gcatcggccg 

cgcaccttcg 

tccctcaatg 

aaaatggact 

aggcctggtc 

tcacgtgcgc 

gcttgacttc 

catccctgtc 

ggaatcagag 

gatcctgccc 

tttcaacttc 

cccctgttgg 

actcgagagt 

tcttagctct 

cccagctgcc 

ccttagggat 

taggtggtgc 

gatcttgagg 

ccaggttccc 

ggcaaggatc 

ccaggctgag 

gcctaggtgc 

tgtgtgcagg 

ggcatgtgag 

gagggaatcc 

cctgggcgga 

ggtgcccctt 

gaagcagccc 

ccttttctcc 

ccagcccctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 

9 OCT 

960 
1020 
1080 
1140 
1200 
1260 
1320' 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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tgccatagcc 
tatggctggg 
ggcaggcagg 
cagccctggc 
tcccaaaggc 
gtccctcttt 
gcgctagctt 
aataaatcaa 
<210> 65 



tgattttggg 
tctggtttct 
ggccacacta 
atgttcctgc 
agtctccgtg 
gagggagggg 
cttttgatac 
ttccaagcct 



gaggaggaag 
tcccttccca 
tgcctgcgcc 
cccacaggaa 
gttgaagcag 
agctatgcta 
tgaaaacttt 
cacctg 



gggcgatttg 
gagggtctta 
ctggtaaagg 
tagaatggag 
actggatttt 
ggactccaac 
taaggtggga 



agggagaagg 
ctgttccagg 
tgacccctgc 
ggagctccag 
tgctctgccc 
ctcagggact 
gggtggcaag 



ggagaaagct 
gtggccccag 
catttaccag 
aaactttcca 
ctgacccctt 
cgggtggcct 
ggatgtgctt 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2546 



<211> 2683 

<212> DNA 

<213> Homo sapiens 



<400> 65 

aagctcctcc 

tcctgctagc 

accgcgactg 

tccgctcccg 

agtatgagtg 

agttccatgg 

tggcctcgtt 

tgccagcctc 

catggttctg 

acttctgtgc 

ggctgcagca 

tgcacgtctc 

tggctattgg 

ggctgcctca 

tcgagctgct 

tcagcaccat 

tgctgaagga 

cagtggggat 

tttctctttt 

agcccacccc 

atctgggact 

agtgtgttct 

ccacatcccc 

ccacaggcct 

acaccactag 

cccatgggat 

gccctagcca 

agcccagggc 

gtgtgggcca 

agctcaggcc 

aagtgtgtgt 

gctggtgggc 

gcaggatgag 

gacgccacct 

cctgtctggt 

ccctccagaa 

ccatcctcct 

cctgctgcca 

gagaagggga 

ttccagggtg 

cccctgccat 

gctccagaaa 

tctgcccctg 

agggactcgg 

tggcaaggga 

<210> 66 



cccggcggcg 
tggggcagcg 
cgtactgcag 
ccagccaatc 
tatgtgggtc 
caagtggccc 
tctcaatggc 
ctcccccatg 
gtccacagtc 
ctccactgtc 
cccagctgtg 
ctacctgagc 
cctggtcaac 
cgtgcgcaag 
tgacttccca 
ccctgtccac 
atcagaggac 
cctgcccccg 
caacttcttg 
ctgttggccc 
cgagagtggg 
tagctctacc 
agctgcctgg 
tagggataca 
gtggtgctgg 
cttgagggac 
ggttcccagg 
aaggatcctg 
ggctgagtgc 
taggtgcgca 
gtgcaggggg 
atgtgagatg 
ggaatcctgt 
gggcggacag 
gcccctttgc 
gcagcccctc 
tttctccttg 
gcccctttgc 
gaaagcttat 
gccccagggc 
ttaccagcag 
ctttccatcc 
accccttgtc 
gtggcctgcg 
tgtgcttaat 



agccagggag 
gcgctggcga 
tgcgaagagc 
tacatgagtc 
accgttgggc 
ttctcccggt 
ctggccagcc 
taccacacct 
ttccacacca 
atcctacact 
gtcagtgcct 
ctcatccgct 
gtggtgtggt 
tgcgtggtgg 
ccgctcttct 
gtcctctttt 
aagttcaagc 
ccctgctggc 
aacttggaca 
tcaccagcct 
cagcccctct 
gggaggacag 
ctgggtcctg 

gggggtcccc 

atgcttgttc 
caagctgctg 
aggcctcacc 
tgctgctgtc 
ataggtgaca 

gtgtggagac 

tgggtgtgtt 
agtgactgcc 
caccatcaat 
ccaggagctc 
ccgcctcctg 
ggaggcagag 
ccttcgcatg 
catagcctga 
ggctgggtct 
aggcaggggc 
ccctggcatg 
caaaggcagt 
cctctttgag 
ctagcttctt 
aaatcaattc 



aaaggatggc 
gcggctccca 
agaactgctc 
tagcaggctg 
tctacctcca 
tcctgttctt 
tggtgatgct 
gtgtggcctt 
gggacactga 
caatctacct 
tccgggctct 
tcgactatgg 
ggctggcctg 
tggtcttgct 
gggtcctgga 
tcagctttct 
tggactgaag 
ctcccttctc 
tgaaggatgt 
tggagtctgt 
acctcctgga 
ctgcctgttt 
aagccctctg 
ttctgttacc 
tttggccagc 
ggattgggaa 
atactccctt 
tggttgagag 
gggccgtgag 
gggtgttgtc 
agcgtgggtt 
ggtgaatgtg 
aatcacttgt 
tccatggcca 
caaacctcac 
gaaggaaaat 
gctggccttc 
ttttggggag 
ggtttcttcc 
cacactatgc 
ttcctgcccc 
ctccgtggtt 

ggaggggagc 

ttgatactga 
caagcctcac 



cggcctggcg 
gggcgaccgt 

tgggggcgct 
gacctgtcgg 
ggaaggtcac 
tcaagagccg 
ctgccgctac 
cgcctgggtg 
cctcacagag 
gtgctgcgtc 
cctgctgctc 
ctacaacctg 
gtgcctgtgg 
gctgcagggg 
tgcccatgcc 
ggaagatgac 
accttggagc 
ccctcaaccc 
gggcccagaa 
tctagggaag 
gctgaactgg 
cctccccacc 
tctacctggg 
accccccacc 
caaggttcac 
ggagtttcac 
tcagggccag 
cctgccaccg 
catgggcctg 
ggggaagagg 
aggggaacgt 
tccacagttg 
ggagcgccag 
ggctgcctgt 
agggtcccca 
ggggatggct 
ccctccaaaa 
gaggaagggg 
cttcccagag 
ctgcgccctg 
acaggaatag 
gaagcagact 
tatgctagga 
aaacttttaa 
ctg 



gcgcggttgg 
gagccggtgt 
ctgaatcact 
gacgactgta 
aaagtgcctc 
gcatcggccg 
cgcaccttcg 
tccctcaatg 
aaaatggact 
aggaccgtgg 
atgctgaccg 
gtggccaacg 
aaccagcggc 
ctgtccctgc 
atctggcaca 
agcctgtacc 
gagtctgccc 
ttgagatgat 
tcatgtggcc 
gcctcccagc 
ggtggaactg 
agcctcctcc 
agaccaggga 
ctcctccagg 
ggcgattctc 
cctgaccgtt 
ggctccagca 
tgtgtcggga 
ggtgtgtgtg 
tgtggcttca 
gtgtgcgcgt 
agaggttgga 
ctctgcccaa 
gtgcatgttc 
cacaacagtg 
ggggctctct 
cctccattcc 
cgatttgagg 
ggtcttactg 
gtaaaggtga 
aatggaggga 
ggatttttgc 
ctccaacctc 
ggtgggaggg 



60 
120 
180 
240 
300 
360 
420 
480 
.540 
600 
660 
720 
780 
840 
. 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2683 
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<211> 2341 

<212> DNA 

<213> Homo sapiens 



78 



<400> 66 
aagctcctcc 
tcctgctagc 
accgcgactg 
tccgctcccg 
agtatgagtg 
agttccatgg 
tggcctcgtt 
tgccagcctc 
catggttctg 
acttctgtgc 
aagatgacag 
cttggagcga 
ctcaaccctt 
gcccagaatc 
tagggaaggc 
tgaactgggg 
tccccaccag 
tacctgggag 
cccccaccct 
aggttcacgg 
agtttcaccc 
agggccaggg 
tgccaccgtg 
tgggcctggg 
ggaagaggtg 
gggaacgtgt 
cacagttgag 
agcgccagct 
ctgcctgtgt 
ggtccccaca 
ggatggctgg 
ctccaaaacc 
ggaaggggcg 
tcccagaggg 
gcgccctggt 
aggaatagaa 
agcagactgg 
tgctaggact 
acttttaagg 
g 

<210> 67 
<211> 2109 



<212> DNA 

<213> Homo sapiens 



cccggcggcg 
tggggcagcg 
cgtactgcag 
ccagccaatc 
tatgtgggtc 
caagtggccc 
tctcaatggc 
ctcccccatg 
gtccacagtc 
ctccactgtc 
cctgtacctg 
gtctgcccca 
gagatgattt 
atgtggccag 
ctcccagcat 
tggaactgag 
cctcctcccc 
accagggacc 
cctccaggac 
cgattctccc 
tgaccgttgc 
ctccagcaag 
tgtcgggagt 
tgtgtgtgag 
tggcttcaaa 
gtgcgcgtgc 
aggttggagc 
ctgcccaaga 
gcafcgttccc 
caacagtgcc 
ggctctctcc 
tccattcccc 
atttgaggga 
tcttactgtt 
aaaggtgacc 
tggagggagc 
atttttgctc 
ccaacctcag 
tgggagggtg 



agccagggag 
gcgctggcga 
tgcgaagagc 
tacatgagtc 
accgttgggc 
ttctcccggt 
ctggccagcc 
taccacacct 
ttccacacca 
atcctacact 
ctgaaggaat 
gtggggatcc 
tctcttttca 
cccaccccct 
ctgggactcg 
tgtgttctta 
acatccccag 
acaggcctta 
accactaggt 
catgggatct 
cctagccagg 
cccagggcaa 
gtgggccagg 
ctcaggccta 
gtgtgtgtgt 
tggtgggcat 
aggatgaggg 
cgccacctgg 
tgtctggtgc 
ctccagaagc 
atcctccttt 
tgctgccagc 
gaaggggaga 
ccagggtggc 
cctgccattt 
tccagaaact 
tgcccctgac 
ggactcgggt 
gcaagggatg 



aaaggatggc 

gcggctccca 

agaactgctc 

tagcaggctg 

tctacctcca 

tcctgttctt 

tggtgatgct 

gtgtggcctt 

gggacactga 

caatctacct 

cagaggacaa 

tgcccccgcc 

acttcttgaa 

gttggccctc 

agagtgggca 

gctctaccgg 

ctgcctggct 

gggatacagg 

ggtgctggat 

tgagggacca 

ttcccaggag 

ggatcctgtg 

ctgagtgcat 

ggtgcgcagt 

gcagggggtg 

gtgagatgag 

aatcctgtca 

gcggacagcc 

ccctttgccc 

agcccctcgg 

tctccttgcc 

ccctttgcca 

aagcttatgg 

cccagggcag 

accagcagcc 

ttccatccca 

cccttgtccc 

ggcctgcgct 

tgcttaataa 



cggcctggcg 
gggcgaccgt 

tgggggcgct 

gacctgtcgg 
ggaaggtcac 
tcaagagccg 
ctgccgctac 
cgcctgggtg 
cctcacagag 
gtgctgcgtc 
gttcaagctg 
ctgctggcct 
cttggacatg 
accagccttg 
gcccctctac 
gaggacagct 
gggtcctgaa 
gggtcccctt 
gcttgttctt 
agctgctggg 
gcctcaccat 
ctgctgtctg 
aggtgacagg 
gtggagacgg 

ggtgtgttag 

tgactgccgg 
ccatcaataa 
aggagctctc 
gcctcctgca 
aggcagagga 
ttcgcatggc 
tagcctgatt 
ctgggtctgg 
gcaggggcca 
ctggcatgtt 
aaggcagtct 
tctttgaggg 
agcttctttt 
atcaattcca 



gcgcggttgg 

gagccggtgt 

ctgaatcact 

gacgactgta 

aaagtgcctc 

gcatcggccg 

cgcaccttcg 

tccctcaatg 

aaaatggact 

agctttctgg 

gactgaagac 

cccttctccc 

aaggatgtgg 

gagtctgttc 

ctcctggagc 

gcctgtttcc 

gccctctgtc 

ctgttaccac 

tggccagcca 

attgggaagg 

actccctttc 

gttgagagcc 

gccgtgagca 

gtgttgtcgg 

cgtgggttag 

tgaatgtgtc 

tcacttgtgg 

catggccagg 

aacctcacag 

aggaaaatgg 

tggccttccc 

ttggggagga 

tttcttccct 

cactatgcct 

cctgccccac 

ccgtggttga 

aggggagcta 

gatactgaaa 

agcctcacct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780^ 

840^ 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 

216(A 

2220^ 

2280 
2340 
2341 



<400> 67 

gattcggccg gagctgccag cggggaggct gcagccgcgg gttgttacag ctgctggagc 60 

agcagcggcc cccgctcccg ggaaccgttc ccgggccgtt gatcttcggc cccacacgaa 12 0 

cagcagagag gggcagcagg atgaatgtgg gcacagcgca cagcgaggtg aaccccaaca 180 

cgcgggtgat gaacagccgt ggcatctggc tctcctacgt gctggccatc ggtctcctcc 240 

acatcgtgct gctgagcatc ccgtttgtga gtgtccctgt cgtctggacc ctcaccaacc 300 

tcattcacaa catgggcatg tatatcttcc tgcacacggt gaaggggaca ccctttgaga 3 60 
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ccccggacca 

agttcacggc 

gcttctacac 

tgcttatccc 

gagagtgcag 

gctgcaacac 

cctctggcct 

gggttggggt 

ctttttgaat 

atggaccacg 

caggcagcca 

tgtttcactg 

agcaagtgtg 

gcttttggca 

ttggggaaag 

gctgccatcc 

agaagaatgg 

aagctagacc 

tttacccaca 

ttggctgttg 

cttggccctg 

tcctctgcag 

catgtggcag 

ctgtaggatc 

tctcagcctg 

cctccccagt 

ttccaggggt 

catctcccac 

gaataggatt 

gtttctgtt 

<210> 68 



gggcaaggcg 
ctctcggaag 
taagtacgac 
caagctgccc 
ccccttcccc 
tgaagacaga 
aatttccccc 
aggccccctg 
agagactcca 
cagttgcgac 
aagcacttta 
tgattcctag 
ggatgggtct 
gcttctctgc 
atggggtaga 
tggcctggtg 
ggccacatag 
caatctcctt 
gaatgtgata 
ggtagacagg 
actcctgcgg 
gccctcaccc 
ccacatgtgg 
cctgctcaag 
cagagaagaa 
tggactaggg 
ccaagggaac 
cctzgtgtatc 
tttactccct 



aggctgctaa 
ttcttgacca 
cagatccatt 
cagctccacg 
tgcccagggt 
aagaagaagc 
ctcgcttccc 
ggctctgacc 
tggagttggt 
aggaggccag 
acccctgcat 
gttttttcga 
ttgcctttct 
aacccaccgt 
gtgtagatgg 
gaggttgggg 
cagcagagga 
tgggatgtgg 
gtaatagggg 
tggggaaaag 
gggtatgggg 
cttaacttcc 
cgttgtgagg 
cagcacagag 
agtgaggccg 
gcagtgttaa 
aggagaggtc 
atagggaact 
ttgtacagta 



cccactggga 
tcacacccat 
ttgtgctcaa 
gagtccggat 
ggcaggggag 
ctctggacac 
ccagtagcca 
ttttctgaat 
catggaatgg 
gggaaaaacc 
agggagcaga 
tgccatgcag 
gggtagggag 
gggtcctggt 
cttggttcca 
agctgtagga 
ctggtgtaag 
gcagggaggg 
agggaggctg 
gcccgtgagt 
aagctgtgac 
tcatgcagac 
ccaccccatg 
gaaggggcaa 
ggagcctgag 
ttttgaaaag 
actgggcctg 
ttcaccttaa 
ttctgaggaa 



gcagatggat 
cgtgctgtac 
caccgtgtcc 
ttttggaatc 
gggtagggta 
tgccagagat 
acttggagta 
tttttgatct 
gctgggctcc 
cctgctcact 

gggcggtacg 

tgtgtgcttt 
ctgtctaatc 
tgggagtggg 
gaggtgaggg 
gagctagtga 
ggagggaggg 
aagcaggctt 
ctgtgggttt 
cattgtaagc 
agaaacgatg 
tggcactggg 
tggggtctgt 
gacgtggcct 
cctgggctgg 
gtgtgggtcc 
ttttctccct 
aatctttcta 
cgcaaataaa 



tatggggtcc 
ttcctcacca 
ctgatgagcg 
aataagtact 
aaaggcatgt 
gggggttgag 
gctfcgtagtg 
cttccttttg 
tgggctgaac 
tgtttgccct 
gcttctggat 
tgtgtatgga 
caagtcccag 
gagggtcagg 
ggccagggct 
gtcgagactt 
gtagggacag 
ggagggttaa 
aactcctggg 
acaggtccaa 
ggtgctgtgg 
cagggcctct 
ggtgagagtc 
gtaggcactg 
agccttctcc 
ctgtgtcctt 
cctgaccctg 
agcaaagtgt 
agggcaacat 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2109 



<211> 2423 . 

<212> DNA 

<213> Homo sapiens 



<400> 68 

gagagccgag 

gcctagccgg 

gcgcccagtc 

gctcgccgca 

gagtgaccaa 

aacttcccaa 

gctgtgctca 

gaattgacct 

aacgatgtgg 

atgcattaag 

gtacaaagac 

accttgactt 

gatgtccact 

ttcaagcact 

agctagaaga 

acttgcagac 

ataagttaca 

ctctaggtca 

cagatgtggg 

aagagtgtgt 

ttcaagtatt 

ggaatggggc 

tcacagatgc 

atgactgcca 

ttaaagtcca 



ctagcgacga 
agccgagagg 
cccgtcccgg 
gcgcccggcc 
gagcaggttt 
agaactcctg 
ggtctccagg 
atttgatttc 
gggcttttta 
aacctttgca 
aacagacgct 
ggcttcctgt 
gttggagcag 
agtgaggggc 
tgaagctctc 
ttgcttgcaa 
atccctttgt 
gaactgccca 
ctttaccact 
tcagataaca 
gagtctgtct 
ctgcgcccat 
atccctggag 
gcaaatcaca 
cgcctacttc 



gcagtcgttg 
tctcttgttc 
aactcccggg 
cgggccgcac 
gagatgttct 
ttacggatat 
gcctggaatg 
cagagggata 
cgaaagttaa 
caaaactgca 
acatgtacta 
acatcaataa 
ttgaacattt 
tgtgggggtc 
aagtacatag 
atcacagatg 
gcctctggct 
cggcttagaa 
ctagccagga 
gatagcacat 
cactgtgagc 
gaccagctgg 
cacttgaaga 
cgggctggaa 
gcacctgtca 



cggccgccgg 
ccgtcccacg 
cctgtcctgg 
ccgccggccc 
caaatagtga 
tttcttttct 
ttctggctct 
ttgagggccg 
gtcttcgtgg 
ggaacattga 
gccttagcaa 
caaacatgtc 
cctggtgtga 
tcaaggcctt 
gtgcacactg 
aaggtctcat 
gctccaacat 
tattggaagt 
attgccatga 
taatccaact 
tgatcacaga 
a 99tgattga 
gctgtcatag 
tcaagagact 
ctccaccccc 



cgccgcggga 
gtcccggcgt 
gcccccggtc 
catgaggagg 
tgaagctgta 
agatgttgtt 
ggatggcagt 
agtagtggag 
atgtcttgga 
agtactgaat 
gttctgttcc 
tctaaaagct 
ccaagtaacc 
attcttaaaa 
ccctgaactg 
tactatatgc 
cacagatgcc 
ggcaagatgt 
acttgaaaag 
ttctatacac 
tgatggaatt 
gctggacaac 
ccttgagcgg 
caggacccat 
atcagtaggg 



ggtggtggag 
cacccctccg 
tgtgcactcc 
gacgtgaacg 
atcaataaaa 
accctgtgcc 
aactggcagc 
aatatttcaa 
gtgggagaca 
ctaaatgggt 
aaactcaggc 
ctgagtgagg 
aaggatggca 
ggctgcacgc 
gtgactttga 
agagggtgcc 
atcctgaatg 
tctcaattaa 
atggacctgg 
tgtcctcgac 
cgtcacctgg 
tgcccactaa 
atagaactct 
ttacccaata 
ggcagcagac 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



BHC03 1 051 



-80- 



agcgcttctg cagatgctgc atcatcctat gacaatggag gtggtcaacc ttggcgaact 1560 

gagtatttaa tgacacttct agagctaccg tggagtctct ccagtggaag caaccccagt 1620 

gttctgagca agggttacaa agtgagggag ggcagtgtcc agatccccag agccacacat 1680 

acatacacat acacaccctt acccccatcc actctagctt tgtgaccatg ggactgaagt 1740 

ttgtgatggc ttttttatca agtagattgg taaaatttaa ccattcctgt tgaggtgccc 1800 

ataagaaaat cataggccaa gatagggagg ggcattccag caaaccccgt gttaatgcta 1860 

ctgtggtttt taaatttttg tctaggggtt tctttgggga ttttagaaca gcatctgctg 1920 

tcctccgggg tcaagaaaag catggaaaga caatatatga tgtacccagg gaccagaaag 1980 

aaaatttctt tgcatcttag aaatggtaga cattcattgt gactaaagag cttctatgct 2040 

tccttgtttc catgccaaca tgctgagcat gctcacaaag aaggctcgtc cattcctcct 2100 

gtgttttagt atttggccca gaggtttcct aaatggttgc cttgaaatca ctgtggtcca 2160 

aatgtaattc ttacacactc aaattatcac tgtctgtagc acacttgtgc acctgtctta 2220 

cattctctgt tgctcccccc cacactcttg ctcagtctgt cacctgttca gtctgcttac 2280 

tcactcaatt gttacccttt tgctgttgtc gtgtttacag tttgcatttt gaatgattag 2340 

ttgggattac caaacatttt ttaaaaagat attatcaata aatatttttt taattctaaa 2400 

ttttaaaaaa aaaaaaaaaa aaa -• 2423 
<210> 69 



<211> 1841 



<212> DNA 

<213> Homo sapiens 



<400> 69 ... 

agctgggacc ggagggtgag cccggcagag gcagagacac acgcggagag gaggagaggc 60 

t 9 a 999 a ggg aggtggagaa ggacgggaga ggcagagaga ggagacacgc agagacactc 120 

aggaggggag agacaccgag acgcagagac actcaggagg ggagagacac cgagacgcag 180 

agacacccag gccggggagc gcgagggagc gaggcacaga cctggctcag cgagcgcggg 240 

gggcgagccc cgagtcccga gagcctgggg gcgcgcccag cccgggcgcc gaccctcctc 300 

ccgctcccgc gccctcccct cggcgggcac ggtattttta tccgtgcgcg aacagccctc 360 

ctcctcctct cgccgcacag cccgccgcct gcgcggggga gcccagcaca gaccgccgcc 420 

gggaccccga gtcgcgcacc ccagccccac cgcccacccc gcgcgccatg gaccccaagg 4 80 

accgcaagaa gatccagttc tcggtgcccg cgccccctag ccagctcgac ccccgccagg 540 

tggagatgat ccggcgcagg agaccaacgc ctgccatgct gttccggctc tcagagcact 600 

cctcaccaga ggaggaagcc tccccccacc agagagcctc aggagagggg caccatctca 660 

agtcgaagag acccaacccc tgtgcctaca caccaccttc gctgaaagct gtgcagcgca 720 

ttgctgagtc tcacctgcag tctatcagca atttgaatga gaaccaggcc tcagaggagg 780 
aggatgagct gggggagctt cgggagctgg gttatccaag agaggaagat gaggaggaag 840 
aggaggatga tgaagaagag gaagaagaag aggacagcca ggctgaagtc ctgaaggtca 900 

tcaggcagtc tgctgggcaa aagacaacct gtggccaggg tctggaaggg ccctgggagc 960 

gcccaccccc tctggatgag tccgagagag atggaggctc tgaggaccaa gtggaagacc 102^^ 

cagcactaag tgagcctggg gaggaacctc agcgcccttc cccctctgag cctggcacat 108^p 

aggcacccag cctgcatcfcc ccaggaggaa gtggagggga catcgctgtt ccccagaaac 1140 

ccactctatc ctcaccctgt tttgtgctct tcccctcgcc tgctagggct gcggcttctg 12 00 

acttctagaa gactaaggct ggtctgtgtt tgcttgtttg cccacctttg gctgataccc 1260 

agagaacctg ggcacttgct gcctgatgcc cacccctgcc agtcattcct ccattcaccc 1320 

agcgggaggt gggatgtgag acagcccaca ttggaaaatc cagaaaaccg ggaacaggga 13 80 

tttgcccttc acaattctac tccccagatc ctctcccctg gacacaggag acccacaggg 1440 

caggacccta agate tgggg aaaggaggtc ctgagaacct tgaggtaccc ttagatcctt 1500 

ttctacccac tttcctatgg aggattccaa gtcaccactt ctctcaccgg cttctaccag 1560 

ggtccaggac taaggcgttt ttctccatag cctcaacatt ttgggaatct tcccttaatc 1620 

acccttgctc ctcctgggtg cctggaagat ggactggcag agacctcttt gttgcgtttt 1680 

gtgetttgat gecaggaatg ccgcctagtt tatgtccccg gtggggcaca cagegggggg 1740 

cgccaggttt tccttgtccc ccagctgctc tgcccctttc cccttcttcc ctgactccag 1800 

gcctgaaccc ctcccgtgct gtaataaatc tttgtaaata a 1841 
<210> 70 



<211> 748 



<212> DNA 



<213> Homo sapiens 



BHC03 1 051 



-81 - 



<400> 70 

ggccgcgatg 

gccgggcagt 

ctacctggag 

cctcgggggc 

ggagaatggg 

tggagaaacc 

caggactctg 

tgggagctcc 

actttgccct 

agacactctc 

acgaaagccc 

attctcccct 

aaaaaaaaaa 

<210> 71 



agcggggagc 

ggggtccgca 

ctggccagtg 
acaggtgcct 
ggctttccct 
ctagaaaaga 
ggttcctgct 
ccctgcctct 
ttgggtacaa 
catgaacact 
agtccactct 
aacccagggc 
aaaaaaaaaa 



c ggggcagac 

tcgtggtgga 
ctgtgaagga 
ttgagataga 
atgagaaaga 
tcaccaacag 
ctgttctggg 
ttcccctact 
agaaggaata 
tctccagcca 
tcgcctcggt 
aatgtcagct 
aaaaaaaa 



gtccgtagcg 
gtactgtgaa 
gcagtatccg 
gataaatgga 
tctcattgag 
ccgtcctccc 
gtccaaacct 
tagctcctta 
gaagattccg 
cctcataccc 
aatacctgtc 
attggcagta 



ccccctcccg 
ccctgcggct 
ggcatcgaga 
cagctggtgt 
gccatccgaa 
tgcgtcatcc 
tggtctccct 
gcaaagagac 
tggccttggg 
ccttcccagg 
tgatgccaca 
aagtggcgct 



aggaggtcga 
tcgaggcgac 
tcgagtcgcg 
tctccaagct 
gagccagtaa 
tgtgactgca 
ttggtcctgc 
cctggcctcc 
ggcaggagag 
gtaagtgccc 
gattttattt 
acaaacacta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
748 



<211> 795 
<212> DNA 
<213> Homo sapiens 



<400> 71 

tacggctgcg agaagacgac agaagctaga cccaatctcc tttgggatgt gggcagggag 60 

ggaagcaggc ttggagggtt aatttaccca cagaatgtga tagtaatagg ggagggaggc 120 

tgctgcgggt ttaactcctg ggttggctgt tgggtagaca ggtggggaaa aggcccgtga 180 

gtcattgtaa gcacaggtcc aacttggccc tgactcctgc gggggtatgg ggaagctgtg 240 

acagaaacga tgggtgctgt ggtcctctgc aggccctcac cccttaactt cctcatacag 3 00 

actggcactg ggcagggcct ctcatgtggc agccacatgt ggcgttgtga ggccacccca 360 

tgtggggtct gtggtgagag tcctgtagga tccctgctca agcagcacag aggaaggggc ,420 

aagacgtggc ctgtaggcac tgtctcagcc tgcagagaag aaagtgaggc cgggagcctg • 4 80 

agcctgggct ggagccttct cccctcccca gttggactag gggcagtgtt aattttgaaa 540 

aggtgtgggt ccctgtgtcc tcttccaggg gtccaaggga acaggagagg tcactgggcc "600 

tgttttctcc ctcctgaccc tgcatctccc accccgtgta tcatagggaa ctttcacctt . 660 

aaaatctttc taagcaaagt gtgaatagga tttttactcc ctttgtacag tattctgaga 720 

aacgcaaata aaagggcaac atgtttctgt taaaaaaaaa aaaaagtacg caaaaaaaaa 780 

aaaaaaaaaa aaaaa 795 
<210> 72 

<211> 2356 

<212> DNA 

<213> Homo sapiens 



<400> 72 

ggcacgaggc cggaagtgac ctctagagcg 
gactagatcc cgcgaggtag cccccgaact 
aaatcttcca gatccgcgta gtgaggaatc 
aatctgaaga agagctggca cccgcagacc 
gagcagaagc atgaggctga gcggaagaag 
gagagagccc gggaagagat gcagcgctat 
gaagaaaagt tggactggat gtaccagggt 
ctgctggggc gccccattga caaatatgtt 
tgctcttctg aaacaggact tctcccaggc 
cttcttgaca tggccagcaa gatccgggag 
gaggagaaaa aacgagaggt attaaataat 
ttgcaaatga gtctggaaaa aaaggagaag 
aagaaacata agcacagaag ctcgagtagt 
gcagggagat cacagaagaa gatggcaaat 



gtggtgaaac tggcagttga cggctcctgg 60 

atttctctac gttttctctt gatcctcccg 120 

gtctccaccg tcatgggggg cggagacctg 180 

ctcaggaatg tggagaaagt gtggaaggcc 240 

attgaggagc ttcagcggga gctgcgagaa 300 

gcggaggatg ttggggccgt caagaaaaaa 360 

cctggtggga tggtgaaccg tgacgagtac 420 

tttgagaaga tggaggagaa ggaggcaggc 480 

tctatctttg ccccatcagg tgccaattcc 540 

gacccactct tcatcatcag gaagaaggag 600 

ccagtgaaaa tgaagaaaat caaagaattg 660 

aagaaaaaga aggagaagaa aaagaagcac 720 

gatcgttcca gcagcgagga tgagcacagt 780 

tcctcccctg ttttgtccaa agtccctgga 840 



BHC03 1 051 



82 



tatggcttac 
gagcaaaaga 
cccccaagac 
cgatccctgg 
aacaggagag 
cgacatgctc 
gagatgatgg 
gctaaggatg 
atccaccgca 
aatatctact 
aaactgtccc 
ttctctttat 
tgaccctgct 
tttgtgggac 
tcggtgggag 
gccacttcct 
cttcagaaac 
tacttgattt 
gacagaatct 
ctttggatgt 
aaaaggttag 
acattgaacc 
tctggctggt 
cactgcttgc 
agaaccaaat 
aaaaaaaaaa 
<210> 73 



aggtccggaa 
gagggcatgg 
atgccagcaa 
gcagaaggtc 
agacaggcca 
ccggatacac 
aaaacgccaa 
aggaacggga 
tgaagctgga 
ctttacagag 
ctctcttatt 
aagagttcaa 
tcattgagtc 
actcagtaac 
tgcttgtgcc 
tcttacctgt 
ctctctggtg 
agaagataat 
ggaaaatcaa 
aaaattgtaa 
ctttgtgaaa 
ttgatggcaa 
ttaggaggag 
agtctccaat 
tgccaatatg 
aaaaaa 



ctctgaccgt 
gatgaagaac 
gaagagcacc 
acggtcccca 
aactaggagc 
cagaaaactc 
atggagggag 
gcagaggcta 
gagtgcatct 
aacttcggta 
ggttttcctg 
atgacttctt 
ctgaaacagc 
tttgggtttt 
actctggaag 
gccaacagac 
tcacccagat 
gtgacagaat 
acaatacaaa 
tgcgtatatg 
ataccttgtt 
gtaatacaat 
cctgggttta 
gtaggcagtg 
ctccatggct 



aaccagggtc 
cattccagat 
agggaagcag 
agacccagca 
ccatcaccta 
tctgcagagg 
gaggagagac 
gagaagctgg 
acttcctccc 
gctctggaga 
cattttccag 
tcacagatgt 
tcacttcctt 
gactctttaa 
gctgttccct 
ctatttcact 
agattgtgct 
gatgtcaggt 
aagccctaaa 
tacaaatgta 
tggtcaatga 
aaggcaggcc 
tcgacgagat 
taaaggtata 
cctaaaggaa 



ttcagggtcc 
ccagaagctc 
ggtcccggga 
aactgcacaa 
aaaaagaggt 
aattagagcg 
tgaacatcct 
actcccggga 
tggaggatcg 
agaactttat 
ggaagctgct 
caaaccacca 
tgagagctag 
cgggtgggca 
ggggttgtga 

gcctcagcgt 
tactgagaca 
taggtcaaag 
tgaactgtta 
caatttttac 
ctttactggg 
agctcgtttt 
ctggagtatc 
gtaaaatgat 
aataaaatgg 



tctgacagca 
ctcccactca 
caggaggtct 
ctctaaggtg 
ctaccaaagg 
aaaacggcaa 
caagaggcat 
tgggaagttc 
ggtgaagcgg 
gaaaagatga 
gaccccttaa 
gtgttcaaag 
tgtgacttgc 
ctggaccatc 
tgtttatcat 
acaccagacc 
aatgaacgtt 
ccaagggagt 
actatttgat 
atgcttttaa 
taatagaacc 
tctctctgaa 
tattcttttc 
tttaggagtc 
aagtttttaa 



900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160^ 
222d^ 
2280 
2340 
2356 



<211> 1646 

<212> DNA 

<213> Homo sapiens 



<400> 73 

gtggaatgtc 

ccatctggat 

cctgacacct 

caaaaactga 

aggaaatcac 

gaagccttgt 

tctttggatg 

gggacaagtg 

ttctggataa 

tttcaggcag 

cccagcagta 

tccgatcaat 

aggaggaaac 

atctcagcta 

gctatcgagt 

cggaggattc 

tgacagagga 

atattcggca 

tggaggaccc 

aagcgcgtgc 

agcagtttgt 

aatctgtcat 

accctgaggc 

ctgaggggcc 

gggttttccc 

ggacctcttt 

catccaaaac 

tgttaaaaaa 

<210> 74 



atcagttaag 
gtatacatgc 
caggctgcca 
aattggcttc 
aagaattgta 
tgatgctgat 
ccggcactac 
gttagatgaa 
tgtagactca 
tttccagggc 
tctggctacc 
taatacgaga 
cctgaaaagc 
taaacacaag 
aaagcagctt 
cagaaacatg 
gaagagaaaa 
ggatctagag 
agacaagcct 
aaaagccatt 
ggctgaggcc 
ggagcagaac 
acgaattctc 
tacctctgfcc 
tttaccagtc 
aaaacaagca 
cagcctggat 
aaaaaaaaaa 



gctattttca 
aggtcacagg 
aatgtggaag 
tgtttctgag 
gttaaggaga 
agattccgct 
acaacaggcc 
ctggattctg 
acgggagagt 
ttccaccatc 
cttgaaaaca 
gaaaacctgt 
gaccggcaat 
ggccaaaggg 
gtcttcccca 
aaggagaagt 
gatgtgctaa 
caaagagtat 
ctcctaagca 
ctggacttcc 
ctggagaagg 
tgggatgagc 
tgtgcgctgt 
tcttcctaac 
tgtcctcact 
gccaaccatt 
ttcatacatg 
aaaaaa 



tttcttttgt 
gaatatgatg 
atttaaatac 
ttggtccagg 
tggatgctgg 
gcttccatct 
tcaccctgat 
ggctccaagg 
tgatagtgag 
agaaaatcaa 
ggaagctgaa 
atctggtgac 
ataaattttg 
aagtgaccat 
acaaggagac 
tggaggacat 
actccctcgc 
ctgaggtcct 
gcctttttaa 
tggatgccct 
ggacccttcc 
tggccagcag 
atgttgttgt 
tacaaaagcc 
gccatcgcca 
ctttgatgta 
gacttctgat 



ggatcttcag 
gcttagcttg 
ttgaaccaat 
cgcaatgttc 
aggggatatg 
ggtgggggag 
ggacattctg 
tcaaaaggct 
attacccaaa 
gatatcggag 
gagggaacta 
agaaactctg 
gagccagatc 
ccccccaaat 
gatgagaaag 
ggagagtgtc 
taagtgcctc 
gatttccggg 
tgctgctggg 
gctagagctg 
tctgttgaag 
tcctcctgac 
ctctatcctg 
ctttctcccc 
ctaccatcct 
tcccattcgc 
taaaagtggc 



ttgcttcagg 
ggttcagagg 
accctcctcc 
agcgtatttg 
attgccgtta 
aagagaactt 
gacacacatg 
gagtttcaaa 
gaaataacaa 
aaccggatat 
cccttttcat 
gagacggtaa 
tctcagggcc 
cgggtcctga 
tctttgggtt 
ctcaaggacc 
ggcaaggagg 
gagctacaca 
gtcttggtag 
tctgaagagc 
gaccaggtga 
atggactatg 
ctggagctgg 
acaagcctct 
gtcaccagtg 
tccatgttaa 
aggttgtgca 



60 
120 
180 
240 
300 
360 
420 
48fl 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1646 



BHC03 1 051 



-83 



<211> 3340 

<212> DNA 

<213> Homo sapiens 



<400> 74 

cgggcgccca 

tctctcttcc 

ggaggcatgg 

catttcttgg 

aaggatctat 

actctgaggc 

ccacccagca 

cagacccggg 

cactcttatg 

gtgacggaga 

gtagctgtgt 

accgcactgg 

gtgcccctcc 

cctgaggacg 

gagctggagt 

ttggagctgc 

ctctgcgagg 

ctctgcaagc 

gccgctgtca 

tttcgcaaat 

atctccaaaa 

ttcctgcacg 

tcagtggccg 

atggaggcat 

gacaaccaga 

tggaggctcc 

atgtctcaac 

ggtcgatgga 

ttgctccttc 

gaaagctcct 

tcctcttgca 

agggattggc 

ccccaggaat 

tgtgtcttct 

tggggagggg 

gtgctttctc 

ttgtgagcca 

agcccttgca 

ggtaaaagat 

gcagtgggct 

tttttgggat 

gacaccaaca 

agctggggtg 

ggcagagact 

cctctcccat 

gaggtttcct 

cagcaacagc 

cccaaagctc 

tcatatgtga 

cttgttcctg 

gccctgacag 

gcactgtcct 

ccttggaact 

ttggctgctg 

gccctgcagc 

aaccgctggg 



gagacagcgc 

tcccgacgga 

agggccctga 

acacctacca 

acctgctaat 

tgcaccagct 

agcaggtgaa 

tggccttccg 

tgaccatacg 

aacttcaata 

cctcctctgg 

gcatcaacag 

ccgaggagat 

ttgccaacca 

tcgtggacta 

tgctgcagcg 

ccccgggcaa 

agaaccagga 

gccgccttcg 

ttgagaacct 

tgaagccccc 

aagggagtaa 

aaaaagtgag 

cccccaatca 

acctcctctt 

agtcagaccc 

caacatctga 

tttacccctg 

aagccaaaac 

tcctctatgt 

gcaggaaaga 

agccttcttc 

attatgttgc 

tttagggagc 

tccttctgtt 

cacccccaca 

aggtgctttt 

agtcccccac 

gctcagggaa 

cgggtagagc 

atctaactgc 

acatggtctc 

gggtggatcc 

gtaaaggcgc 

caccagcctc 

ggcacagact 

cgtagcaaaa 

tggaattgta 

acattgtatc 

cagcaaaaat 

tgtcccatgg 

ctccctgcag 

ccagctgcca 

cttaccccta 

tttaacagag 

tggagaagaa 



cgcctcagat 

gaaatttctg 

agggctgggc 

ggggctgctt 

tatgaaggac 

ggtggagacg 

gccactcttc 

gggctctgat 

cagccgcctt 

ttcagaggag 

agagaaggtc 

ccacctgttt 

ccaggtctcc 

cctaactgcc 

cgtgttccac 

ctgcagcgag 

gcgcgcgcag 

cctgctgtct 

actcacctgg 

gacggacccc 

tgtgattccc 

gacccttgta 

gacaatccgc 

cctgcagacc 

cgagctctcc 

gccagatcct 

catctttccc 

gacccataag 

tacactttgc 

cctcccatcg 

atgctgctca 

tccaccacct 

cgtgtgtgtg 

aggagtgcat 

tggtgctacc 

ctccctgctc 

tggggcaggg 

tcaggcctct 

acacaggcct 

agtatcagga 

taaggaggga 

cgtccctctc 

ctatactggg 

cactggactc 

aaggcctgag 

cacccttctt 

gcagctgctg 

catttatttt 

ataggtaatg 

cctctatgga 

gcccccttct 

cccctggctt 

ccaccctctg 

gccttgatgc 

tgaaccatgt 

gccgataaag 



atcctgctgg 

caggagctac 

cggaagcaag 

caagaggaag 

gagtcccttt 

gtggaactaa 

cgccacttcc 

gagatcttct 

tcagcatctg 

cccgcggggc 

cttctccagc 

gcctgtactc 

cctggagaca 

ttccactggg 

ggggagcgcg 

gtcacgcact 

ctgctcaaga 

ttctacgccg 

gagaagctgc 

tgcaggaacc 

ttcgtgcctc 

gatggtttgg 

aaataccgga 

aaggcctatg 

tacaagctgg 

tgggcacctg 

gtggagcaac 

tctgttcatc 

tggttcctgt 

agatctgttc 

cccttctgtc 

gtccagcttc 

tgtgtgtgtg 

ctggtaattg 

cttgtctact 

agctcctcgt 

agtagcagca 

ggaaggtcca 

cagctgccta 

gctagggttg 

gttgacatcc 

tcttagactc 

gcaggcagcc 

tggcaaggcc 

gggtgcaggg 

tctggcacca 
ctcctgctat 
ttaaaactca 
ttgtacagac 
cataggaggt 
gctccctgcc 
cccagccttc 
ggtcggacac 
ctgcccaggg 
gtattgtaca 
tctatgaatc 



atgacattgt 

accagtactt 

cctgtctagc 

agggggccgg 

accagggcct 

agattccaga 

gccggataga 

gccgtgtata 

tgcaggacat 

gtgaggattc 

ccactgagga 

gggacagcta 

cagagatcca 

agctgttccg 

gccgccggga 

gggtggccac 

agttcatcaa 

tggtcatggg 

cagggaaatt 

acaaaagcta 

tgatcctcaa 

tgaacatcga 

gccggcccct 

tgcgccagtt 

aggcaaacag 

gcactcaagc 

ttcctgctcc 

ctgctgaagt 

cccctctgag 

tggggatgga 

ttgcagagtg 

ttcctggtca 

tgtgtgtgtg 

agggtggatg 

ctgcccctgg 

gctgccctgc 

ggtgggaggg 

gggatgggct 

gaggaccctc 

tctgctgccc 

cccttctggc 

tccctttgtc 

ccaaagtggg 

tttattacct 

gctcctggca 

ctctttccct 

gagggtgtat 

aagagggaaa 

ccttttatac 

gctgtgtccc 

ccctccctgc 

ctcctgaccc 

tgggacccac 

acccccagcc 

ggcgcggttg 



ccttacccat 

tgttcgggca 

catgcttctc 

ccacatcatc 

ccgagaggac 

ggagaaccag 

ctcctgtctg 

catgcctgac 

tctgggctct 

cctcatcctg 

ctgtgttttc 

tgaggctctg 

ccgagtggag 

atgtgtgcat 

gacggccaac 

cgaagtgctg 

gatcgcggcc 

gctggacaac 

caagaacttg 

ccgagaagtg 

agacctgact 

gaagctgcat 

ttgcctggac 

tcaggtcatc 

tcagtgagag 

actttgcacg 

acgggaaaga 

cccctcccca 

aaaggggata 

gcttccaact 

ggattgtggg 

gggctgggac 

tgtgtgtgtg 

ttgtgtgtgc 

atggtgcggg 

atgcccaggc 

gttacccatc 

ctgatgagag 

cccctgcctt 

acactcctgc 

tcatgtgtct 

ctccccatag 

ggagggggat 

ttactccctc 

gctactgggt 

tttgaagaga 

atatttttta 

gagccttgta 

agtgatctgt 

atgccttctt 

tactgctgat 

cttccaacag 

tggcccagtc 

ccctcccgtt 

tcattgcaga 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3340 
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<210> 75 

<211> 4005 

<212> DNA 

<213> Homo sapiens 



<400> 75 
gggcaacagt 
atctctggga 
cccaccatgt 
gcgccttgtg 
cagtgttctt 
tgcatggggc 
ttcctgccgt 
gacttctcac 
cttcccccag 
aagaaggagg 
aggtgccacg 
cctcgcagtg 
gagagcctca 
gagggtctct 
aaggcctcga 
cggctgggag 
gtgtctgatg 
agccatcagt 
agccttggct 
accctaacca 
agcatctgcc 
ggctgcccag 
caacctggag 
ctgtcagatg 
atgaagttcg 
gcaatcgagc 
taccgaggct 
cgaccaggtg 
ggagtgcatc 
tgccagcatt 
ctgggaggtg 
tctgcgggct 
ctggccgtgc 
gtggtccact 
tttttccggc 
cggaagtggc 
acagagcctc 
tcgcaggtca 
gactgcctgt 
ctgaatgggt 
catcaagttg 
ccgggcctca 
atgctggtgg 
cccctgcatg 
acaacagtct 
ggtgacccac 
gaatggatag 
gtggcttttt 
gcagctcagc 
cttgcagatg 
gagacagagt 
acctccacct 
caggcatgtg 
gttggccagg 
agtgctggga 



ctgcccacct 

tgtcagtgag 

cccaggtgat 

atgagcccag 

actacaccac 

cccctggccg 

cccctggtga 

tggagagcct 

ggactggggg 

aatctgaagc 

attggcccca 

gtggcctcct 

tcttctctgg 

cccctcgacc 

gcccccatgg 

gcctggcccc 

tcagctatat 

catcttccag 

cagtgtccct 

tgggccaacc 

ccccatccat 

aaccagggfcc 

ctgcttctcc 

ccccctttac 

tgatggacac 

tgctgaggaa 

ccttcggcct 

aggacagcaa 

tcaaaggagc 

ccatcatggc 

cagatggggc 

gccacaccct 

agaaagccat 

tcgaagtcac 

gccattaccc 

agaagtactg 

aggagaacgt 

tcggcctggt 

gcacctaacc 

gtggcttgtg 

acacccttga 

gtttcctcaa 

gcatttgaac 

aaaataacct 

gagcaagaga 

ggggctcctg 

actctatttc 

gggtccaccc 

agggggaact 

agtatcaccc 

ctcactgtca 

cccaggttca 

caactcaccc 

ctggtcttga 

ttacaggcgc 



gtggacacca 
gctggttgaa 
gtccagccca 
gaggaccctg 
ggaaggctgg 
actccagcaa 
gaaggccttg 
caatcagatg 
ctcccaggct 
cttggacata 
gcactgctcc 
cctttccaga 
gaaccagggc 
cccaaattcc 
tttgggctcc 
acagcggggc. 
gtttggaagc 
atccttggaa 
gtgtacaaga 
cagaacaccc 
caccaactcc 
ttctccaccc 
cagcaacccc 
cacatgccca 
atctaaatac 
ggaggagcca 
ggccctgaag 
tgacctcatc 
agatgaggag 
cctggccctg 
ctcggactct 
gtacctgagc 
ctccaccacc 
agagcagggc 
actcaccacc 
caaaccctcc 
atgccacctc 
gactgctctg 
aacacctcca 
gccatattga 
acctgctatg 
tcatagaaga 
atgctcctcc 
ccaaggaccc 
cctgcagccc 
gcttgcagct 
cttccatatc 
agccaggatg 
tgtccccatg 
catcttttct 
cccaggctga 
agcaattatc 
agctaatttt 
actcctgacc 
aagccaccca 



gatcctggga 
gaccagaggt 
ctgctggcag 
cacccagcac 
ggagcccagg 
gccccacagg 
gggaccccag 
atcctggaac 
gagctggccc 
aagtacatcg 
agcccctctg 
gacgtccccc 
agggggcacc 
cccagcatct 
ccgctggtgg 
agcaggatct 
agccagtccc 
agtccagcca 
cccagtgact 
cactctccac 
atggtggaca 
cagcggaccc 
tgtccagcca 
gagggtcccg 
tggtttaagc 

ggggcttttg 

gtgcaggagg 
cgacacttcc 
ccctactttg 
ccctgcaaac 
acagacagcc 
tcagtgagcg 
tttgagaggg 
atcactctga 
ctccgcttct 
tggatctttg 
tttgcggagt 
ctgcaggacg 
ggggctcgct 
cagaccaatc 
gccttcagca 
agaccaatag 
atgattctga 
tctgacccca 
ctgtttcgtg 
ggtgatggtc 
tgttcctctg 
ctgcaggcca 
gtcagaggag 
ttccacttgg 
actgcagtgg 
ctgcctcagg 
gtatttttag 
gcaggtaatc 
gcccagcttc 



gctcctggtt 

aaactgcaga 

gaggccatgc 

ccagccccag 

ccctgatggc 

tggaggccaa 

aggaccttga 

tggaccccac 

agagcaccat 

aggtgacctc 

tcaccccgcc 

gagagacacg 

agcgccctct 

caatcccttg 

cttctccaag 

ctgtgctgtc 

tcctgcactc 

actcttcctc 

tccaggctcc 

cactggccaa 

tacccattgt 

caggacacca 

ccaggagcaa 

ccagggacat 

caaacatcac 

tcataaggga 

ttcccgcgtc 

tcatcgagtc 

ggagcctctc 

tcaccatccc 

cagcctcctg 

tggagaccct 

acatcctccc 

ctgatgtcca 

gtggtatgga 

ggtttgtggc 

atgacatggt 

cagaaaggat 

aaggagcccc 

tatgggacta 

gtcaccatca 

acaagatcag 

agcatgcaca 

tcgacctggg 

gcagacagca 

aagaactgac 

ttccttttcc 

agctgggtgt 

acccagctgt 

tttttatttt 

tgtgatctag 

ctcccgagta 

tagagacagg 

cacctgcttc 

tttccattcc 



agcaagtgag 

ggtcaccacc 

tgtcagcttg 

cctgccaccc 

ccccgtgccc 

agccacctgc 

ctcctacatt 

cttccagctg 

gtcaatgaga 

cgccagatca 

cttcggctcc 

aagcagcagt 

gcccccctca 

catggggagc 

actggagaag 

agccagccca 

cagcaactcc 

cagcctccac 

cagaaacccc 

agaacatgcc 

gctgatcaac 

gaactccgtt 

cagccagacc 

gcagcccacc 

ccgagagcaa 

cagctcttca 

tgctcagaat 

gtctgccaaa 

tgccttcgtg 

acagagagaa 

ccagaagaaa 

gactggagcc 

cacgcccacc 

gaggaaggtg 

ccctgagcaa 

caagagccag 

ccagccagcc 

gtaggggaga 

cctccacccc 

gggggattgg 

tccagacccc 

ctgttcttag 

cctctgaaga 

ccctgcccac 

ggtgcctggc 

tacaaaacag 

cactttctgg 

ggtatttagg 

cctgcacccc 

tatttttttt 

gctcactgca 

gctgggatta 

gtttcaccat 

ggcctcccaa 

ttgataggcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660^ 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
204 

210^ 

2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
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a 3 tattccaa agctggtatc gtagctgccc taatgttgca tattaggcgg cgggggcaqa 33G0 

gataagggcc atctctctgt gattctgcct cagctcctgt cttgctgagc cctcccccaa 3420 

cccacgctcc aacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 34 8 0 

cacgcccctc tactgctatg tggcttcaac cagcctcaca gccacacggg ggaagcagag 3540 

agtcaagaat gcaaagaggc cgcttcccta agaggcttgg aggagctggg ctctatccca 3600 

cacccacccc caccccaccc ccacccagcc tccagaagct ggaaccattt ctcccgcagg 3660 

cctgagttcc taaggaaacc accctaccgg ggtggaaggg agggtcaggg aagaaaccca 3720 

ctcttgctct acgaggagca agtgcotgcc ccctcccagc agccagccct gccaaagttg 3730 

cattatcttt ggccaaggct gggcctgacg gttatgattt cagccctggg cctgcaggag 3 840 

aggctgagat cagcccaccc agccagtggt cgagcactgc cccgccgcca aagtctgcag 3900 

aatgtgagat gaggttctca aggtcacagg ccccagtccc agcctggggg ctggcagagg 3 960 

cccccatata ctctgctaca gctcctatca tgaaaaataa aatgt 4005 

<211> 1093 

<212> PRT 

<213> Homo sapiens 



<400> 76 

Met Lys Glu Met Val Gly Gly Cys Cys Val Cys Ser Asp Glu Arg Gly 

1 5 10 15 

Trp Ala Glu Asn Pro Leu Val Tyr Cys Asp Gly His Ala Cys Ser Val 

20 25 30 

Ala Val His Gin Ala Cys Tyr Gly lie Val Gin Val Pro Thr Gly Pro 

35 40 45 

Trp Phe Cys Arg Lys Cys Glu Ser Gin Glu Arg Ala Ala Arg Val Arg 

50 55 60 

Cys Glu Leu Cys Pro His Lys Asp Gly Ala Leu Lys Arg Thr Asp Asn 
65 „ 70 75 80 

Gly Gly Trp Ala His Val Val Cys Ala Leu Tyr lie Pro Glu Val Gin 

85 90 95 

Phe Ala Asn Val Leu Thr Met Glu Pro lie Val Leu Gin Tyr Val Pro 

100 105 no 

His Asp Arg Phe Asn Lys Thr Cys Tyr He Cys Glu Glu Thr Gly Arq 

13 -5 120 125 

Glu Ser Lys Ala Ala Ser Gly Ala Cys Met Thr Cys Asn Arg His Glv 

130 135 140 

Cys Arg Gin Ala Phe His Val Thr Cys Ala Gin Met Ala Gly Leu Leu 
145 150 155 * 160 

Cys Glu Glu Glu Val Leu Glu Val Asp Asn Val Lys Tyr Cys Gly Tyr 

165 170 175 

Cys Lys Tyr His Phe Ser Lys Met Lys Thr Ser Arg His Ser Ser Glv 

18 ° 185 190 

Gly Gly Gly Gly Gly Ala Gly Gly Gly Gly Gly Ser Met Gly Gly Glv 

195 200 205 

Gly Ser Gly Phe He Ser Gly Arg Arg Ser Arg Ser Ala Ser Pro Ser 

210 215 220 

Thr Gin Gin Glu Lys His Pro Thr His His Glu Arg Gly Gin Lys Lys 
225 230 235 * 240 

Ser Arg Lys Asp Lys Glu Arg Leu Lys Gin Lys His Lys Lys Arg Pro 

245 250 255 

Glu Ser Pro Pro Ser He Leu Thr Pro Pro Val Val Pro Thr Ala Asn 

260 265 270 

Lys Val Ser Ser Ser Ala Ser Ser Ser Ser His His Glu Ala Ser Thr 

275 280 285 

Gin Glu Thr Ser Glu Ser Ser Arg Glu Ser Lys Gly Lys Lys Ser Ser 

290 295 300 

Ser His Ser Leu Ser His Lys Gly Lys Lys Leu Ser Ser Gly Lys Gly 
05 310 315 320 

Val Ser Ser Phe Thr Ser Ala Ser Ser Ser Ser Ser Ser Ser Ser Ser 
325 330 335 
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Ser Ser Gly Gly Pro Phe Gin Pro Ala Val Ser Ser Leu Gin Ser Ser 

340 345 350 

Pro Asp Phe Ser Ala Phe Pro Lys Leu Glu Gin Pro Glu Glu Asp Lys 

355 360 365 

Tyr Ser Lys Pro Thr Ala Pro Ala Pro Ser Ala Pro Pro Ser Pro Ser 

370 375 380 

Ala Pro Glu Pro Pro Lys Ala Asp Leu Phe Glu Gin Lys Val Val Phe 
385 390 395 400 

Ser Gly Phe Gly Pro lie Met Arg Phe Ser Thr Thr Thr Ser Ser Ser 

405 410 415 

Gly Arg Ala Arg Ala Pro Ser Pro Gly Asp Tyr Lys Ser Pro His Val 

420 425 430 

Thr Gly Ser Gly Ala Ser Ala Gly Thr His Lys Arg Met Pro Ala Leu 

435 * 440 445 

Ser Ala Thr Pro Val Pro Ala Asp Glu Thr Pro Glu Thr Gly Leu Lys 

450 455 460 

Glu Lys Lys His Lys Ala Ser Lys Arg Ser Arg His Gly Pro Gly Arg 
465 470 475 480 

Pro Lys Gly Ser Arg Asn Lys Glu Gly Thr Gly Gly Pro Ala Ala Pro 

485 490 495 

Ser Leu Pro Ser Ala Gin Leu Ala Gly Phe Thr Ala Thr Ala Ala Ser 

500 505 510 

Pro Phe Ser Gly Gly Ser Leu Val Ser Ser Gly Leu Gly Gly Leu Ser 

515 520 525 

Ser Arg Thr Phe Gly Pro Ser Gly Ser Leu Pro Ser Leu Ser Leu Glu 

530 535 540 

Ser Pro Leu Leu Gly Ala Gly lie Tyr Thr Ser . Asn Lys Asp Pro. lie 
545 550 555 560 

Ser His Ser Gly Gly Met Leu Arg Ala Val Cys Ser Thr Pro Leu Ser 

565 570 575 

Ser Ser Leu Leu Gly Pro Pro Gly Thr Ser Ala Leu Pro Arg Leu Ser 

580 ^ 585 590 

Arg Ser Pro Phe Thr Ser Thr Leu Pro Ser Ser Ser Ala Ser lie Ser 

595 600 605 

Thr Thr Gin Val Phe Ser Leu Ala Gly Ser Thr Phe Ser Leu Pro Ser 

610 615 620 

Thr His lie Phe Gly Thr Pro Met Gly Ala Val Asn Pro Leu Leu Ser 
625 630 635 640 

Gin Ala Glu Ser Ser His Thr Glu Pro Asp Leu Glu Asp Cys Ser Phe 

645 650 655 

Arg Cys Arg Gly Thr Ser Pro Gin Glu Ser Leu Ser Ser Met Ser Pro 

660 665 670 

He Ser Ser Leu Pro Ala Leu Phe Asp Gin Thr Ala Ser Ala Pro Cys 

675 680 685 

Gly Gly Gly Gin Leu Asp Pro Ala Ala Pro Gly Thr Thr Asn Met Glu 

690 " 695 700 

Gin Leu Leu Glu Lys Gin Gly Asp Gly Glu Ala Gly Val Asn He Val 
705 710 715 720 

Glu Met Leu Lys Ala Leu His Ala Leu Gin Lys Glu Asn Gin Arg Leu 

725 730 735 

Gin Glu Gin lie Leu Ser Leu Thr Ala Lys Lys Glu Arg Leu Gin lie 

740 745 750 

Leu Asn Val Gin Leu Ser Val Pro Phe Pro Ala Leu Pro Ala Ala Leu 

755 760 765 

Pro Ala Ala Asn Gly Pro Val Pro Gly Pro Tyr Gly Leu Pro Pro Gin 

770 775 780 

Ala Gly Ser Ser Asp Ser Leu Ser Thr Ser Lys Ser Pro Pro Gly Lys 
785 ~ 790 795 800 

Ser Ser Leu Gly Leu Asp Asn Ser Leu Ser Thr Ser Ser Glu Asp Pro 

805 810 815 

His Ser Gly Cys Pro Ser Arg Ser Ser Ser Ser Leu Ser Phe His Ser 

820 825 830 

Thr Pro Pro Pro Leu Pro Leu Leu Gin Gin Ser Pro Ala Thr Leu Pro 

835 840 845 

Leu Ala Leu Pro Gly Ala Pro Ala Pro Leu Pro Pro Gin Pro Gin Asn 
850 855 860 
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Gly Leu Gly 


Arg 


Ala 


Pro 


Gly Ala 


Ala 


Gly 


Leu 


Gly 


Ala 


Met 


Pro 


Met 


865 






870 








875 










880 


Ala Glu Gly 


Leu 


Leu 


Gly 


Gly Leu 


Ala 


Gly 


Ser 


Gly 


Gly 


Leu 


Pro 


Leu 




885 








890 










895 




Asn Gly Leu 


Leu 


Gly 


Gly 


Leu Asn 


Gly 


Ala 


Ala 


Ala 


Pro 


Asn 


Pro 


Ala 


900 








905 










910 






Ser Leu Ser 


Gin 


Ala 


Gly 


Gly Ala 


Pro 


Thr 


Leu 


Gin 


Leu 


Pro 


Gly 


Cys 


915 








920 










925 




Gin 


Glu 


Leu Asn Ser 


Leu 


Thr 


Glu 


Gin Gin 


Arg 


His 


Leu 


Leu 


Gin 


Gin 


930 








935 








940 










Gin Gin Leu 


Gin 


Gin 


Leu 


Gin Gin 


Leu 


Leu 


Ala 


Ser 


Pro 


Gin 


Leu 


Thr 


945 






950 








955 










960 


Pro Glu His 


Gin 


Thr 


Val 


Val Tyr 


Gin 


Met 


He 


Gin 


Gin 


He 


Gin 


Gin 






965 






970 










975 




Lys Arg Glu 


Leu 


Gin 


Arg 


Leu Gin 


Met 


Ala 


Gly 


Gly 


Ser 


Gin 


Leu 


Pro 


980 








985 










990 






Met Ala Ser 


Leu 


Leu 


Ala 


Gly Ser 


Ser Thr Pro Leu Leu Ser Ala Gly 



995 1000 1005 



Thr Pro 


Gly Leu Leu 


Pro 


Thr 


Ala 


Ser 


Ala 


Pro 


Pro 


Leu 


Leu 


Pro 


1010 




1015 










1020 








Ala Gly 


Ala Leu Val 


Ala 


Pro 


Ser 


Leu Gly Asn 


Asn 


Thr 


Ser 


Leu 


1025 






1030 










1035 








Met Ala 


Ala Ala Ala 


Ala 


Ala 


Ala 


Ala 


Val 


Ala 


Ala 


Ala Gly Gly 


1040 






1045 










1050 








Pro Pro 


Val Leu Thr 


Ala 


Gin 


Thr 


Asn 


Pro 


Phe 


Leu 


Ser 


Leu 


Ser 


1055 






1060 










1065 








Gly Ala 


Glu Gly Ser Gly Gly 


Gly 


Pro 


Lys 


Gly 


Gly 


Thr 


Ala 


Asp 


1070 






1075 










1080 








Lys Gly 


Ala Ser Ala 


Asn 


Gin 


Glu 


Lys 


Gly 












1085 






1090 



















<210> 77 

<211> 344 

<212> PRT 

<213> Homo sapiens 



<400> 77 



Met 


His 


Arg 


Thr 


Thr 


Arg 


He 


Lys 


1 








5 








Met 


Cys 


Ala 


Leu 
20 


Cys 


Gly 


Gly 


Tyr 


Glu 


Cys 


Leu 


His 


Ser 


Phe 


Cys 


Lys 




35 










40 


Thr 


Asn 
50 


Lys 


Tyr 


Cys 


Pro 


Met 
55 


Cys 


Pro 


Leu 


Leu 


Ser 


He 


Arg 


Ser 


Asp 


65 










70 






Lys 


Leu 


Val 


Pro 


Gly Leu 


Phe 


Lys 








85 








Phe 


Tyr 


Ala 


Ala 
100 


Tyr 


Pro 


Leu 


Thr 


Asp Arg Gly Glu 


Val 


Leu 


Glu 


Gin 






115 










120 


Glu 


He 
130 


Val 


Ser 


Leu 


Ser 


He 
135 


Glu 


Asp 


Glu 


Lys 


Lys 


Gly Pro 


Leu 


Glu 


145 










150 






Thr Gly Val Arg 


Phe 


Leu 


Arg 


Cys 










165 








Leu 


Ala 


Lys 


Phe 


Leu 


Arg 


Asn 


Lys 



180 



He 


Thr 
10 


Glu 


Leu 


Asn 


Pro 


His 
15 


Leu 


Phe 


He 


Asp 


Ala 


Thr 


Thr 


He 


Val 


25 










30 






Thr 


Cys 


He 


Val 


Arg 
45 


Tyr 


Leu 


Glu 


Asp 


Val 


Gin 


Val 
60 


His 


Lys 


Thr 


Arg 


Lys 


Thr 


Leu 


Gin 


Asp 


He 


Val 


Tyr 




75 










80 


Asp 


Glu 
90 


Met 


Lys 


Arg 


Arg 


Arg 
95 


Asp 


Glu 


Val 


Pro 


Asn 


Gly 


Ser 


Asn 


Glu 


105 










110 






Glu 


Lys 


Gly 


Ala 


Leu 
125 


Ser 


Asp 


Asp 


Phe 


Tyr 


Glu 


Gly 
140 


Ala 


Arg 


Asp 


Arg 


Asn 


Gly 


Asp 
155 


Gly 


Asp 


Lys 


Glu 


Lys 
160 


Pro 


Ala 
170 


Ala 


Met 


Thr 


Val 


Met 
175 


His 


Met 


Asp 


Val 


Pro 


Ser 


Lys 


Tyr 


Lys 



185 190 
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Val 


torJLU 


val Leu 


Tyr 








fie u 


ASp 


lie Ala 


Tyr 








lieu 


Lys 


Tyr Arg 


Val 


o o c 






vax 


Pro 


Thr Pro 


Ser 








245 


Glu 


Ser 


Val Ser 


Asp 






260 


Sear 


Ser 


Ser Leu 


Pro 






275 




Ser 


HIS 


Gly Pro 


Pro 




290 






Thr 


Ala 


Ser Gly 


Ala 


305 






Leu 


Gin 


Thr Pro 


Ser 








325 


Gly 


Ala 


Pro Val 


Pro 






340 





<210> 78 

<211> 416 

<212> PRT 

<213> Homo sapiens 



-88- 

Glu Asp Glu Pro Leu Lys 
200 

lie Tyr Pro Trp Arg Arg 
215 

Gin Pro Ala Cys Lys Arg 
230 235 
Glu Gly Thr Asn Thr Ser 
250 

Lys Ala Pro Ser Pro Ala 
265 

Ser Pro Ala Thr Pro Ser 
280 

Ala Thr His Pro Thr Ser 
295 

Thr Thr Ala Ala Asn Gly 
310 ... 315 

Ser Thr Ser Arg Gly Arg 
330 

Pro Leu Thr 



Glu Tyr Tyr Thr Leu 
205 

Asn Gly Pro Leu Pro 
220 

Leu Thr Leu Ala Thr 
240 

Gly Ala Ser Glu Cys 
255 

Thr Leu Pro Ala Thr 
270 

His Gly Ser Pro Ser 
285 

Pro Thr Pro Pro Ser 
300 

Gly Ser Leu Asn Cys 
320 

Lys Met Thr Val Asn 
335 



<400> 78 

Met Ser Ser Asn Cys Thr Ser Thr Thr Ala Val Ala Val Ala Pro Leu 

1 5 io 15 

Ser Ala Ser Lys Thr Lys Thr Lys Lys Lys His Phe Val Cys Gin Lys 

20 25 3 £ 

Val Lys Leu Phe Arg Ala Ser Glu Pro lie Leu Ser Val Leu Met Trp 

35 40 45 

Gly Val Asn His Thr He Asn Glu Leu Ser Asn Val Pro Val Pro Val 

50 55 go 

Met Leu Met Pro Asp Asp Phe Lys Ala Tyr Ser Lys He Lys Val Asp 

Z S . 70 7 5 " 80 

Asn His Leu Phe Asn Lys Glu Asn Leu Pro Ser Arg Phe Lys Phe Lys 

85 90 ^ 95 

Glu Tyr Cys Pro Met Val Phe Arg Asn Leu Arg Glu Arg Phe Gly He 

100 105 ~ no 

Asp Asp Gin Asp Tyr Gin Asn Ser Val Thr Arg Ser Ala Pro He Asn 

115 120 125 

Ser Asp Ser Gin Gly Arg Cys Gly Thr Arg Phe Leu Thr Thr Tyr Asp 

130 135 14Q 

Arg Arg Phe Val He Lys Thr Val Ser Ser Glu Asp Val Ala Glu Met 
145 150 155 160 

His Asn He Leu Lys Lys Tyr His Gin Phe He Val Glu Cys His Gly 

165 170 175 

Asn Thr Leu Leu Pro Gin Phe Leu Gly Met Tyr Arg Leu Thr Val Asp 

180 185 ^ 190 

Gly Val Glu Thr Tyr Met Val Val Thr Arg Asn Val Phe Ser His Arg 

"5 200 205 

Leu Thr Val His Arg Lys Tyr Asp Leu Lys Gly Ser Thr Val Ala Ara 

210 215 220 

Glu Ala Ser Asp Lys Glu Lys Ala Lys Asp Leu Pro Thr Phe Lvs Asp 
I 25 230 235 y 240 

Asn Asp Phe Leu Asn Glu Gly Gin Lys Leu His Val Gly Glu Glu Ser 

245 250 255 

Lys Lys Asn Phe Leu Glu Lys Leu Lys Arg Asp Val Glu Phe Leu Ala 
260 265 270 
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-89- 



Gin 


Leu 


Lys 


lie 


Met 


Asp 


Tyr 


Ser 


Leu 


Leu 


Val 


Gly 


He 


His 


Asp 


Val 






275 










280 








285 






Asp 


Arg 


Ala 


Glu 


Gin 


Glu 


Glu 


Met 


Glu 


Val 


Glu 


Glu 


Arg 


Ala 


Glu 


Asp 




290 










295 










300 








Glu 


Glu 


Cys 


Glu 


Asn 


Asp 


Gly 


Val 


Gly 


Gly 


Asn 


Leu 


Leu 


Cys 


Ser 


Tyr 


305 










310 










315 








320 


Gly 


Thr 


Pro 


Pro 


Asp 


Ser 


Pro 


Gly 


Asn 


Leu 


Leu 


Ser 


Phe 


Pro 


Arg 


Phe 










325 










330 










335 




Phe 


Gly 


Pro 


Gly Glu 


Phe 


Asp 


Pro 


Ser 


Val 


Asp 


Val 


Tyr Ala 


Met 


Lys 








340 










345 










350 




Ser 


His 


Glu 


Ser 


Ser 


Pro 


Lys 


Lys 


Glu 


Val 


Tyr 


Phe 


Met 


Ala 


He 


He 






355 










■J oU 


















Asp 


lie 


Leu 


Thr 


Pro 


Tyr 


Asp 


Thr 


Lys 


Lys 


Lys 


Ala 


Ala 


His 


Ala 


Ala 




370 










one 
37b 










380 










Lys 


Thr 


Val 


Lys 


His 


Gly 


Ala 


Gly 


Ala 


Glu 


He 


Ser 


Thr 


Val 


Asn 


Pro 


385 










"3 Q f\ 










395 










400 


Glu 


Gin 


Tyr 


Ser 


Lys 


Arg 


Phe 


Asn 


Glu 


Phe 


Met 


Ser 


Asn 


He 


Leu 


Thr 










405 










410 










415 




<210> 79 




























<211> 500 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 79 




























Met 


Arg 


Gly 


Glu 


Leu 


Trp 


Leu 


Leu 


Val 


Leu 


Val 


Leu 


Arg 


Glu 


Ala 


Ala 


1 








5 










10 








15 




Arg 


Ala 


Leu 


Ser 


Pro 


Gin 


Pro 


Gly 


Ala 


Gly 


His 


Asp 


Glu 


Gly 


Pro 


Gly 








20 










25 










30 




Ser 


Gly 


Trp 


Ala 


Ala 


Lys 


Gly 


Thr 


Val 


Arg 


Gly 


Trp 


Asn Arg 


Arg 


Ala 






35 










40 










45 






Arg 


Glu 


Ser 


Pro 


Gly 


His 


Val 


Ser 


Glu 


Pro 


Asp 


Arg 


Thr Gin 


Leu 


Ser 




50 










55 








60 










Gin 


Asp 


Leu 


Gly 


Gly 


Gly 


Thr 


Leu 


Ala 


Met 


Asp 


Thr 


Leu 


Pro 


Asp 


Asn 


65 










70 










75 








80 


Arg 


Thr 


Arg 


Val 


Val 


Glu 


Asp 


Asn 


His 


Ser 


Tyr 


Tyr 


Val 


Ser 


Arg 


Leu 










85 










90 










95 




Tyr 


Gly 


Pro 


Ser 


Glu 


Pro 


His 


Ser 


Arg 


Glu 


Leu 


Trp 


Val 


Asp 


Val 


Ala 








100 










105 








110 






Glu 


Ala 


Asn 


Arg 


Ser 


Gin 


Val 


Lys 


He 


His 


Thr 


He 


Leu 


Ser 


Asn 


Thr 






115 










120 










125 








His 


Arg 


Gin 


Ala 


Ser 


Arg 


Val 


Val 


Leu 


Ser 


Phe 


Asp 


Phe 


Pro 


Phe 


Tyr 




130 










135 










140 








Gly 


His 


Pro 


Leu 


Arg 


Gin 


He 


Thr 


He 


Ala 


Thr 


Gly 


Gly 


Phe 


He 


Phe 


145 










150 










155 






160 


Met 


Gly 


Asp 


Val 


He 


His 


Arg 


Met 


Leu 


Thr 


Ala 


Thr 


Gin 


Tyr 


Val 


Ala 










165 










170 








175 




Pro 


Leu 


Met 


Ala 


Asn 


Phe 


Asn 


Pro 


Gly 


Tyr 


Ser 


Asp 


Asn 


Ser 


Thr 


Val 








180 










185 








190 






Val 


Tyr 


Phe 


Asp 


Asn 


Gly 


Thr 


Val 


Phe 


Val 


Val 


Gin 


Trp 


Asp 


His 


Val 






195 










200 










205 






Tyr 


Leu 


Gin 


Gly 


Trp 


Glu 


Asp 


Lys 


Gly 


Ser 


Phe 


Thr 


Phe 


Gin 


Ala 


Ala 




210 










215 










220 










Leu 


His 


His 


Asp 


Gly 


Arg 


He 


Val 


Phe 


Ala 


Tyr 


Lys 


Glu 


He 


Pro 


Met 


225 










230 










235 








240 


Ser 


Val 


Pro 


Glu 


He 


Ser 


Ser 


Ser 


Gin 


His 


Pro 


Val 


Lys 


Thr 


Gly 


Leu 










245 










250 








255 




Ser 


Asp 


Ala 


Phe 


Met 


He 


Leu 


Asn 


Pro 


Ser 


Pro 


Asp 


Val 


Pro 


Glu 


Ser 








260 










265 








270 






Arg 


Arg 


Arg 


Ser 


He 


Phe 


Glu 


Tyr 


His 


Arg 


He 


Glu 


Leu 


Asp 


Pro 


Ser 






275 










280 










285 
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Lys 


Val 


Thr 


Ser 


Met 


Ser 


Ala 


Val 




290 














Leu 


Gin 


His 


ftA 3 


Ser 


Cys 


Asp 


Ala 


J U J 










•lift 

J Xv 






Aon 


Owes 
v— Jy o 






jr S3 


His 
nxo 


Val 

V CLX 


XJC LX 










10 5 








.nxy 


Tvr 


AT*CJ 


Gin 


Glu 


Trn 


Met 


Asp 








•J4Q 
-J «* w 










v»iy 


Arg 


nc l» 


v*ys 


m ii 

VJXU 


Ron 


XT XXG 


Gl n 

UXU 






J DO 














nop 




Car 
uCi. 


Php 


OCX 


JTX U 


xyr 




■> / V/ 














Ser 




XJC Li, 


Phe 


He 

X -X C 


Aon 


OCX 


XJC LX 


-» O -J 










•jqn 








XT X V 


Ayr 


AT a 


OX Jr 


VJiy 


7\ en 


m v 

uxy 










A O C 








X 11 X 


iiyb 


gi \y 


TVl Y* 

XXI XV 


rXO 


vol 


XT -I e 


Leu 








>1 n n 
4 -<s U 










a 1 a 
Aid 


vai 


Leu 


Leu 


Vdl 


Aid 


Aid 


IIS 






4 1 cr 










A A A 


Gly 


His 


Pro 


Thr 


Ser 


Asn 


Ala 


Ala 




450 










455 




His 


His 


Trp 


Pro 


Ala 


Met 


Lys 


Phe 


465 










470 






Tyr 


Ala 


Glu 


Val 


Glu 


Pro 


Ser 


Gly 










485 








Ala 


Glu 


Gin 


Cys 
















500 










<210> 80 












<211> 509 












<212> PRT 












<213> Homo 


sapiens 








<400> 80 












Met 


Glu 


Asp 


He 


Gin 


*"PV» v* 

xiir 


Asn 


Ala 


1 








5 








Ser 


Val 


Pro 


Ala 


Glu 


oer 


Ala 


Aid 








20 










Lys 


Ser 


His 


Glu 


Met 


w_L Li 


a en 
AS XX 


\7a 1 
Veil 






35 










a n 
4 u 


Glu 


Asp 


Glu 


Asp 


Xle 


Gl \y 


ASp 


a en* 




50 














Ser 


Glu 


Arg 


Asp 


Glu 


Asn 


Veil 


Leu 


65 
















Glu 


Glu 


Pro 


Glu 


He 


XT X W 


XyiLT 


OCX 










85 








Glu 


As 11 


lie 


Lys 


Leu 


Glu 




His 








100 










Pro 


Thr 


Ser 


Gly 


Lys 


Met 


Asn 


v-y » 






115 










ion 

1 A W 


Ser 


Phe 


Asn 


Val 


Leu 


ric L- 


V dX 


nxp 




130 










X J J 




Pro 


Phe 


Gin 


Cys 


Asn 


Gin 


Cys 


Gly 


145 










150 






Leu 


Leu 


Arg 


His 


lie 


Lys 


Leu 


His 










165 








His 


Leu 


Cys 


Asn 


Tyr 


Ala 


Cys 


Gin 








180 










Leu 


Arg 


Thr 


His 


Ser 


Val 


Glu 


Lys 




195 










200 



-90- 



Glu 


Phe 


Thr 


Pro 


Leu 


Pro 


Thr 


Cys 








300 










Cys 


Met 


Ser 


Ser 


Asp 


Leu 


Thr 


Phe 




31 5 










320 


Gin Arg 


Cys 


Ser 


Ser 


Glv 


Phe 


Asd 




330 










335 




Tyr Gly 


Cys 


Ala 


Gin 


Glu 


Ala 


Glu 


345 










350 






Asp 


Glu 


Asp 


His 


Asp 


Ser 


Ala 


Ser 










365 








Asp Gly 


Asd 


Leu 


Thr 


Thr 


Thr 


Ser 








380 










Thr 


Thr 


Glu 


As ID 




Thr 


Lvs 


Leu 






395 










400 


Leu 


Gin 


Asn 


Asn 


Leu 


Ser 


Pro 


Lys 




410 










41 5 

*x X «J 




Gly Thr 


He 


Val 


Glv 
v*xy 


He 


Val 


Leu 


425 










430 






lie 


Leu 


Ala 


Gly 


He 


Tyr 


He 


Asn 










445 








Leu 


Phe 


Phe 


He 


Glu 


Arg 


Arg 


Pro 








460 










Arg 


Ser 


His 


Pro 


Asp 


His 


Ser 


Thr 






475 










480 


His 


Glu 


Lys 


Glu 


Gly 


Phe 


Met 


Glu 




490 

■ • 










495 




Glu 


Leu 


xj y o 


R » T~ 
OCX 


XXIX 


Gin 

L7X11 


Glu 

LJXU 


Gl n 

w XXX 




10 










1 5 
X3 




Val 


Leu 


Asn 


Asp 


Tvr 
xyx 


Ser 


Leu 


Thr 


25 
















Asp 


Ser 




wX LX 


Glv 
Oiy 


IT X \J 


Ala 

iUCl 


asn 
nou 










A 5 








Ser 


Met 


Lys 


Val 


Lys 


Asp 


Glu 


Tvr 

xyx 








fin 










Lys 


Ser 


Glu 


Pro 


Met 


Glv 


Asn 


Ala 






75 










80 


Tyr 


Ser 




Glu 


Tvr 
j 


Asn 


Glu 


Tvr 

xy x 




90 










95 




Val 


Val 


Ser 


Phe 


ASD 


Ser 


Ser 


Arcr 


105 










110 






Asp 


Val 


Cys 


Glv 


Leu 


Ser 


Cys 


He 










125 

JL -J 








Lys 


Arg 


Ser 


His 


Thr 


Gly 


Glu 


Arcr 








140 

X ** \J 










Ala 


Ser 


Phe 


Thr 


Gin 


Lys 


Gly 


Asn 






155 










160 


Thr Gly 


Glu 


Lys 


Pro 


Phe 


Lys 


Cys 




170 










175 




Arg Arg 


Asp 


Ala 


Leu 


Thr 


Gly 


His 


185 










190 






Pro 


Tyr 


Lys 


Cys 


Glu 


Phe 


Cys 


Gly 










205 
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Arg Ser Tyr Lys Gin Arg Ser Ser Leu Glu Glu His Lys Glu Arg Cys 

210 215 220 

Arg Thr Phe Leu Gin Ser Thr Asp Pro Gly Asp Thr Ala Ser Ala Glu 
225 230 235 240 

Ala Arg His lie Lys Ala Glu Met Gly Ser Glu Arg Ala Leu Val Leu 

245 250 " 255 

Asp Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser Met Pro Gin 

260 265 270 

Lys Phe lie Gly Glu Lys Arg His Cys Phe Asp Val Asn Tyr Asn Ser 

275 280 285 

Ser Tyr Met Tyr Glu Lys Glu Ser Glu Leu lie Gin Thr Arg Met Met 

290 295 300 

Asp Gin Ala lie Asn Asn Ala He Ser Tyr Leu Gly Ala Glu Ala Leu 
3 °5 310 315 320 

Cys Pro Leu Val Gin Thr Pro Pro Ala Pro Thr Ser Glu Met Val Pro 

325 330 335 

Val He Ser Ser Met Tyr Pro He Ala Leu Thr Arg Ala Glu Met Ser 

340 345 350 

Asn Gly Ala Pro Gin Glu Leu Glu Arg Lys Ser He Leu Leu Pro Glu 

355 360 ~ 365 

Lys Ser Val Pro Ser Glu Arg Gly Leu Ser Pro Asn Asn Ser Gly His 

370 375 380 

Asp Ser Thr Asp Thr Asp Ser Asn His Glu Glu Arg Gin Asn His He 
385 390 395 400 

Tyr Gin Gin Asn His Met Val Leu Ser Arg Ala Arg Asn Gly Met Pro 

405 410 ~ 415 

Leu Leu Lys Glu Val Pro Arg Ser Tyr Glu Leu Leu Lys Pro Pro Pro 

420 425 430 

He Cys Pro Arg Asp Ser Val Lys Val He Asp Lys Glu Gly Glu Val 

435 440 445 

Met Asp Val Tyr Arg Cys Asp His Cys Arg Val Leu Phe Leu Asp Tyr 

450 455 460 

Val Met Phe Thr He His Met Gly Cys His Gly Phe Arg Asp Pro Phe 
465 470 475 " 480 

Glu Cys Asn Met Cys Gly Asp Arg Ser His Asp Arg Tyr Glu Phe Ser 

485 490 - 495 

Ser His He Ala Arg Gly Glu His Arg Ser Leu Leu Lys 
500 505 

<210> 81 

<211> 440 

<212> PRT 

<213> Homo sapiens 



<400> 81 






Met 
1 


Pro 


He 


Pro 


Pro Pro Pro Pro 


Pro 


Thr 


Phe 


His 


5 

Gin Ala Asn Thr 








20 




Glu 


Gin 


Arg 


Gly 


Arg Gly Ala Leu 






35 




40 


Lys 


Leu 


Lys 


Lys 


Val Thr Asn He 




50 






55 


Glu 


Lys 


Pro 


Lys 


Gly Ser Ser Gly 


65 








70 


Leu 


Gin 


Pro 


Lys 


Gly Gly Leu Phe 










85 


Pro 


val 


Gly Ala 


Lys Asp Gly Ser 








100 




Leu 


Gin 


He 


Pro 


Ser Ser Arg Ala 






115 




120 



Pro Pro Pro Gly Pro Pro Pro Pro 

10 15 
Glu Gin Pro Lys Leu Ser Arg Asp 
25 30 
Leu Gin Asp He Cys Lys Gly Thr 
45 

Asn Asp Arg Ser Ala Pro He Leu 
60 

Gly Tyr Gly Ser Gly Gly Ala Ala 

75 80 
Gin Gly Gly Val Leu Lys Leu Arg 

90 95 
Glu Asn Leu Ala Gly Lys Pro Ala 
105 110 
Ala Ala Pro Arg Pro Pro Val Ser 
125 
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Ala Ala Ser Gly Arg Pro Gin Asp Asp Thr Asp Ser Ser Arg Ala Ser 

130 135 " 140 

Leu Pro Glu Leu Pro Arg Met Gin Arg Pro Ser Leu Pro Asp Leu Ser 
145 150 155 160 

Arg Pro Asn Thr Thr Ser Ser Thr Gly Met Lys His Ser Ser Ser Ala 

165 170 175 

Pro Pro Pro Pro Pro Pro Gly Arg Arg Ala Asn Ala Pro Pro Thr Pro 

180 185 190 

Leu Pro Met His Ser Ser Lys Ala Pro Ala Tyr Asn Arg Glu Lys Pro 

195 200 205 

Leu Pro Pro Thr Pro Gly Gin Arg Leu His Pro Gly Arg Glu Gly Pro 

210 215 220 

Pro Ala Pro Pro Pro Val Lys Pro Pro Pro Ser Pro Val Asn lie Arg 
225 230 235 240 

Thr Gly Pro Ser Gly Gin Ser Leu Ala Pro Pro Pro Pro Pro Tyr Arg 

245 250 255 

Gin Pro Pro Gly Val Pro Asn Gly Pro Ser Ser Pro Thr Asn Glu Ser 

260 265 270 

Ala Pro Glu Leu Pro Gin Arg His Asn Ser Leu His Arg Lys Thr Pro 

275 280 285 

Gly Pro Val Arg Gly Leu Ala Pro Pro Pro Pro Thr Ser Ala Ser Pro 

290 295 300 

Ser Leu Leu Ser Asn Arg Pro Pro Pro Pro Ala Arg Asp Pro Pro Ser 
305 310 315 320 

Arg Gly Ala Ala Pro Pro Pro Pro Pro Pro Val lie Arg Asn Gly Ala 

325 330 335 

Arg Asp Ala Pro Pro Pro Pro Pro Pro Tyr Arg Met His Gly Ser Glu 

340 345 350 

Pro Pro Ser Arg Gly Lys Pro Pro Pro Pro Pro Ser Arg Thr Pro Ala 

355 360 365 

Gly Pro Pro Pro Pro Pro Pro Pro Pro Leu Arg Asn Gly His Arg Asp 

370 375 380 

Ser lie Thr Thr Val Arg Ser Phe Leu Asp Asp Phe Glu Ser Lys Tyr 
385 390 395 400 

Ser Phe His Pro Val Glu Asp Phe Pro Ala Pro Glu Glu Tyr Lys His 

405 410 415 

Phe Gin Arg He Tyr Pro Ser Lys Thr Asn Arg Ala Ala Arg Gly Ala 

420 425 430 

Pro Pro Leu Pro Pro He Leu Arg 
435 440 
<210> 82 

<211> 205 

<212> PRT 

<213> Homo sapiens 



<400> 82 






Met 


Ser 


He 


Met 


Ser Tyr Asn Gly 


1 








5 


Lys 


Asn 


Cys 


Val 


Ala He Ala Ala 






20 




Gin 


Met 


Val 


Thr 


Thr Asp Phe Gin 






35 




40 


Leu 


Tyr 


He Gly 


Leu Ala Gly Leu 




50 






55 


Gin 


Arg 


Leu 


Lys 


Phe Arg Leu Asn 


65 








70 


Gin 


He 


Lys 


Pro 


Tyr Thr Leu Met 








85 


Glu 


Lys 


Arg 


Phe 


Gly Pro Tyr Tyr 








100 





Gly Ala Val Met Ala Met Lys Gly 

10 15 
Asp Arg Arg Phe Gly He Gin Ala 
25 30 
Lys He Phe Pro Met Gly Asp Arg 
45 

Ala Thr Asp Val Gin Thr Val Ala 
60 

Leu Tyr Glu Leu Lys Glu Gly Arg 

75 80 
Ser Met Val Ala Asn Leu Leu Tyr 

90 95 
Thr Glu Pro Val He Ala Gly Leu 
105 110 
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Asp Pro Lys Thr Phe Lys Pro Phe He Cys Ser Leu Asp Leu He Gly 

115 120 125 

Cys Pro Met Val Thr Asp Asp Phe Val Val Ser Gly Thr Cys Ala Glu 

130 135 140 

Gin Met Tyr Gly Met Cys Glu Ser Leu Trp Glu Pro Asn Met Asp Pro 
145 150 155 160 

Asp His Leu Phe Glu Thr He Ser Gin Ala Met Leu Asn Ala Val Asp 

1^5 170 175 

Arg Asp Ala Val Ser Gly Met Gly Val He Val His He He Glu Lys 

180 185 190 

Asp Lys He Thr Thr Arg Thr Leu Lys Ala Arg Met Asp 
195 200 205 

<210> 83 

<211> 190 

<212> PRT 

<213> Homo sapiens 



<400> 83 

Leu Thr Arg Ser Cys Ser Thr Cys Cys Pro Ala Val Ala Cys Leu Val 

1 5 io 15 

Gly Arg Gly Val Val Thr Ser Gly Ala Met His Gin Cys Trp Gly Glu 

20 25 30 

Glu Met Leu Gin Gly Met Leu Leu Trp Gly Trp Ala Thr Cys Pro Leu 

35 40 45 

Ser Asn Pro Gly Arg Trp Gly Arg Thr Val Gly Leu Gin His Pro Ala 

50 55 60 

Val Val Ser Ala Phe Arg Ala Leu Leu Leu Leu Met Leu Thr Val His 

f/\ 70 75 80 

Val Ser Tyr Leu Ser Leu He Arg Phe Asp Tyr Gly Tyr Asn Leu Val 

85 90 95 

Ala Asn Val Ala He Gly Leu Val Asn Val Val Trp Trp Leu Ala Trp 

100 105 no 

Cys Leu Trp Asn Gin Arg Arg Leu Pro His Val Arg Lys Cys Val Val 

115 120 125 

Val Val Leu Leu Leu Gin Gly Leu Ser Leu Leu Glu Leu Leu Asp Phe 

130 135 140 

Pro Pro Leu Phe Trp Val Leu Asp Ala His Ala He Trp His He Ser 
145 150 155 160 

Thr He Pro Val His Val Leu Phe Phe Ser Phe Leu Glu Asp Asp Ser 

165 170 175 

Leu Tyr Leu Leu Lys Glu Ser Glu Asp Lys Phe Lys Leu Asp 
180 185 " 190 

<210> 84 

<211> 368 

<212> PRT 

<213> Homo sapiens 



<400> 84 

Ala Pro Pro Pro 

1 

Ala Arg Leu Val 
20 

Gin Gly Asp Arg 
35 

Glu Gin Asn Cys 
50 



Ala Ala Ser Gin 
5 

Leu Leu Ala Gly 

Glu Pro Val Tyr 
40 

Ser Gly Gly Ala 
55 



Gly Glu Arg Met 
10 

Ala Ala Ala Leu 
25 

Arg Asp Cys Val 

Leu Asn His Phe 
60 



Ala Gly Leu Ala 
15 

Ala Ser Gly Ser 
30 

Leu Gin Cys Glu 
45 

Arg Ser Arg Gin 
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Pro lie Tyr Met 


Ser Leu Ala Gly Trp Thr Cys 


Arg Asp Asp 


Cys 


Lys 


65 






70 


75 








80 


Tyr Glu Cys 


Met 


Trp 


Val 


Thr Val Gly Leu Tyr 


Leu Gin Glu Gly His 




85 




90 






95 




Lys Val Pro 


Gin 


Phe 


His 


Gly Lys Trp Pro Phe 


Ser 


Arg Phe 


Leu 


Phe 


100 






105 




110 






Phe Gin Glu 


Pro 


Ala 


Ser 


Ala Val Ala Ser Phe 


Leu Asn Gly 


Leu 


Ala 


115 








120 




125 






Ser Leu Val 


Met 


Leu 


Cys 


Arg Tyr Arg Thr Phe 


Val 


Pro Ala 


Ser 


Ser 


130 








135 


140 








Pro Met Tyr His 


Thr 


Cys 


Val Ala Phe Ala Trp 


val 


Ser Leu 


Asn 


Ala 


145 






150 


155 








160 


Trp Phe Trp 


Ser 


Thr 


Val 


Phe His Thr Arg Asp 


Thr Asp Leu 


Thr 


Glu 




165 




170 






175 




Lys Met Asp 


Tyr 


Phe 


Cys 


Ala Ser Thr Val He 


Leu 


His Ser 


He 


Tyr 




180 






185 




190 






Leu Cys Cys 


Val 


Arg Thr Val Gly Leu Gin His 


Pro 


Ala Val 


Val 


Ser 


195 








200 




205 






Ala Phe Arg Ala 


Leu 


Leu 


Leu Leu Met Leu Thr 


Val 


His Val 


Ser 


Tyr 


210 








215 


220 








Leu Ser Leu lie 


Arg 


Phe 


Asp Tyr Gly Tyr Asn 


Leu 


Val Ala 


Asn 


Val 


225 






230 


235 








240 


Ala He Gly 


Leu 


Val 


Asn 


Val Val Trp Trp Leu 


Ala 


Trp Cys 


Leu 


Trp 




245 




250 






255 




Asn Gin Arg Arg 


Leu 


Pro 


His Val Arg Lys Cys 


Val 


Val Val 


Val 


Leu 




260 






265 




270 






Leu Leu Gin Gly 


Leu 


Ser 


Leu Leu Glu Leu Leu 


Asp 


Phe Pro 


Pro 


Leu 


275 








280 




285 






Phe Trp Val 


Leu 


Asp Ala His Ala He Trp His 


He 


Ser Thr 


11c 


Pro 


290 








295 


300 








Val His Val 


Leu 


Phe 


Phe 


Ser Phe Leu Glu Asp Asp Ser Leu 


ryr 


J-ICU 


305 






310 


315 








320 


Leu Lys Glu 


Ser 


Glu 


Asp 


Lys Phe Lys Leu Val 


Glu 


Ala Asp 


xrp 


lie 




325 




330 






335 




Phe Ala Leu 


Pro 


Leu 


Thr 


Pro Cys Pro Ser Leu Arg Glu Gly 


Ser 


Tyr 




340 






345 




350 






Ala Arg Thr 


Pro 


Thr Ser Gly Thr Arg Val Ala Cys 


Ala Ser 


Phe 


Phe 


355 








360 




365 






<210> 85 


















<211> 190 


















<212> PRT 


















<213> Homo 


sapiens 















<400> 8b 
















Leu 


Thr 


Arg 


Ser 


Cys Ser Thr Cys Cys Pro Ala 


Val 


Ala 


Cys 


Leu 


Val 


1 






5 10 








15 




Gly 


Arg 


Gly Val 


Val Thr Ser Gly Ala Met His 


Gin 


Cys 


Trp Gly Glu 








20 


25 






30 






Glu 


Met 


Leu 


Gin 


Gly Met Leu Leu Trp Gly Trp 


Ala 


Thr 


Cys 


Pro 


Leu 






35 




40 




45 








Ser 


Asn 


Pro 


Gly 


Arg Trp Gly Arg Thr Val Gly 


Leu 


Gin 


His 


Pro 


Ala 




50 






55 


60 










Val 


Val 


Ser 


Ala 


Phe Arg Ala Leu Leu Leu Leu 


Met 


Leu 


Thr 


Val 


His 


65 








70 75 










80 


Val 


Ser 


Tyr 


Leu 


Ser Leu He Arg Phe Asp Tyr 


Gly 


Tyr Asn 


Leu 


Val 








85 90 








95 




Ala 


Asn 


Val 


Ala 


He Gly Leu Val Asn Val Val 


Trp 


Trp 


Leu 


Ala 


Trp 








100 


105 






110 






Cys 


Leu 


Trp Asn 


Gin Arg Arg Leu Pro His Val 


Arg 


Lys 


Cys 


Val 


Val 






115 




120 




125 
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Val Val Leu Leu Leu Gin Gly Leu Ser Leu Leu Glu Leu Leu Asp Phe 
Pro Pro Leu Phe Trp Val Leu Asp Ala His Ala He Trp His tie Ser 
Thr He Pro. Val His Val Leu Phe Phe Ser Phe Leu Glu Asp Asp Jer 



Leu Tyr Leu Leu Lys Glu Ser Glu Asp Lys Phe Lys Leu Asp 
180 185 



175 

<210> 86 - LHS 190 

<211> 318 
<212> PRT 
<213> Homo sapiens 



<400> 86 

Met Ala Gly Leu Ala Ala Arg Leu Val Leu Leu Ala Gly Ala Ala Ala 
Leu Ala Ser Gly Ser Gin Gly Asp Arg gL Pro Val Tyr Arg Asp Cys 
Val Leu Gin Cys Glu Glu Gin Asn Cys Ser Gly Gly Ala Leu Asn His 
Phe Arg Ser Arg Gin Pro lie Tyr Met Ser Leu Ala G fy Trp Thr Cys 
Arg Asp Asp cys Lys Tyr Glu Cys Met Trp Val Sr Val Gly Leu Tyr 



35 -™ WJ -J r U / B fro File 

Ser Arg Phe Leu Phe Phe Gin Glu Pro Ala Ser Ala Val Ala 2er Phe 
Leu Asn Gly Leu Ala Ser Leu Val Met Leu Cys Arg Tyr Arg Thr Phe 
Val Pro Ala Ser Ser Pro Met Tyr His Thr Cys Val £a Phe Ala Trp 
Val ser Leu Asn Ala Trp Phe Trp Ser Thr Val Phe His Thr Arg Asp 
Thr Asp Leu Gin Arg Lys Trp Thr Thr Ser HI Pro Pro Val Ser Tyr 
Thr Gin Ser Thr Cys Ala Ala Ser Gly Pro Trp Gly Cys Ser Thr Gin 

Leu Trp ser Ser Ala Phe Arg Ala Leu Leu Leu Leu Met III Thr Val 

200 

His Val Ser Tyr Leu Ser Leu lie Arg Phe Asp Tyr Gly Tyr Asn Leu 

Val Ala Asn Val Ala lie Gly Leu Val Asn Val vll Trp Trp Leu Ala 

Trp Cys Leu Trp Asn Gin Arg Arg Leu Pro His Val Arg Lys Cys III 

Val Val Val Leu Leu Leu Gin Gly Leu Ser Leu Leu Glu Leu III Asp 

Phe Pro Pro Leu Phe Trp Val Leu As| Ala His Ala He Trp" His lie 

Ser Thr He Pro Val His Val Leu Phe Phe Ser Phe III Glu Asp 

293 300 
Ser Leu Tyr Leu Leu Lys Glu Ser Glu Asp Lys Phe Lys Leu 

<210> 87 310 315 

<211> 226 
<212> PRT 



<213> Homo sapiens 



BHC03 1 051 



-96- 



Met°Ala 8 Gly Leu Ala Ala Arg Leu Val Leu Leu Ala Gly Ala Ala Ala 

1 5 10 15 

Leu Ala Ser Gly Ser Gin Gly Asp Arg Glu Pro Val Tyr Arg Asp Cys 

20 25 30 

Val Leu Gin Cys Glu Glu Gin Asn Cys Ser Gly Gly Ala Leu Asn His 

35 " 40 4S 

Phe Arg Ser Arg Gin Pro He Tyr Met Ser Leu Ala Gly Trp Thr Cys 

50 55 60 

Arg Asp Asp Cys Lys Tyr Glu Cys Met Trp Val Thr Val Gly Leu Tyr 
65 ~ 70 75 80 

Leu Gin Glu Gly His Lys Val Pro Gin Phe His Gly Lys Trp Pro Phe 

85 90 . ... 95 

Ser Arg Phe Leu Phe Phe Gin Glu Pro Ala Ser Ala Val Ala Ser Phe 

100 105 110 

Leu Asn Gly Leu Ala Ser Leu Val Met Leu Cys Arg Tyr Arg Thr Phe 

115 120 12S 

Val Pro Ala Ser Ser Pro Met Tyr His Thr Cys Val Ala Phe Ala Trp 

130 135 140 

Val Ser Leu Asn Ala Trp Phe Trp Ser Thr Val Phe His Thr Arg Asp 
145 150 155 160 

Thr Asp Leu Thr Glu Lys Met Asp Tyr Phe Cys Ala Ser Thr Val He 

165 170 175 

Leu His Ser lie Tyr Leu Cys Cys Val Arg Pro Gly Gin Arg Gly Val 

180 185 190 

Val Ala Gly Leu Val Pro Val Glu Pro Ala Ala Ala Ala Ser Arg Ala 

1 95 200 205 

Gin Val Arg Gly Gly Gly Leu Ala Ala Ala Gly Ala Val Pro Ala Arg 
210 215 220 



Ala Ala 


225 




<210> 


88 


<211> 


320 


<212> 


PRT 


<213> 


Homo 



Met Ala Gly Leu Ala Ala Arg Leu Val Leu Leu Ala Gly Ala Ala Ala 

1 5 10 15 

Leu Ala Ser Glv Ser Gin Gly Asp Arg Glu Pro Val Tyr Arg Asp Cys 

20" 25 -*0 

Val Leu Gin Cys Glu Glu Gin Asn Cys Ser Gly Gly Ala Leu Asn His 

35 40 45 

Phe Arg Ser Arg Gin Pro He Tyr Met Ser Leu Ala Gly Trp Thr Cys 

50 55 60 

Arg Asp Asp Cys Lys Tyr Glu Cys Met Trp Val Thr Val Gly Leu Tyr 
65 70 75 80 

Leu Gin Glu Gly His Lys Val Pro Gin Phe His Gly Lys Trp Pro Phe 

85 90 95 

Ser Arg Phe Leu Phe Phe Gin Glu Pro Ala Ser Ala Val Ala Ser Phe 

100 105 HO 

Leu Asn Gly Leu Ala Ser Leu Val Met Leu Cys Arg Tyr Arg Thr Phe 

115 120 125 

Val Pro Ala Ser Ser Pro Met Tyr His Thr Cys Val Ala Phe Ala Trp 

130 135 140 

Val Ser Leu Asn Ala Trp Phe Trp Ser Thr Val Phe His Thr Arg Asp 
145 150 155 16° 
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Thr Asp Leu Thr Glu Lys Met Asp Tyr Phe Cys Ala Ser Thr Val lie 

165 170 175 

Leu His Ser lie Tyr Leu Cys Cys Val Arg Thr Val Gly Leu Gin His 

180 185 190 

Pro Ala Val Val Ser Ala Phe Arg Ala Leu Leu Leu Leu Met Leu Thr 

195 200 205 

Val His Val Ser Tyr Leu Ser Leu He Arg Phe Asp Tyr Gly Tyr Asn 
210 215 220 



Leu Val Ala Asn Val Ala He Gly Leu Val Asn Val Val Trp Trp Leu 
225 230 235 240 

Ala Trp Cys Leu Trp Asn Gin Arg Arg Leu Pro His Val Arg Lys Cys 

245 250 255 

Val Val Val Val Leu Leu Leu Gin Gly Leu Ser Leu Leu Glu Leu Leu 

260 265 270 

Asp Phe Pro Pro Leu Phe Trp Val Leu Asp Ala His Ala He Trp His 

275 280 285 

lie Ser Thr lie Pro Val His Val Leu Phe Phe Ser Phe Leu Glu Asp 

290 295 300 

- — « — m,, r Leu Leu i,y S Glu Ser Glu Asp Lys Phe Lys Leu Asp 

310 315 320 



305 




<210> 


89 


<211> 


217 


<212> 


PRT 


<213> 


Homo 



<400> 89 

Ala Pro Pro Pro Ala Ala Ser Gin Gly Glu Arg Met Ala Gly Leu Ala 

x 5 10 15 

Ala Arg Leu Val Leu Leu Ala Gly Ala Ala Ala Leu Ala Ser Gly Ser 

20 25 30 

Gin Gly Asp Arg Glu Pro Val Tyr Arg Asp Cys Val Leu Gin Cys Glu 

35 40 45 

Glu Gin Asn Cys Ser Gly Gly Ala Leu Asn His Phe Arg Ser Arg Gin 

50 55 60 

Pro He Tyr Met Ser Leu Ala Gly Trp Thr Cys Arg Asp Asp Cys Lys 
65 * 70 75 80 

Tyr Glu Cys Met Trp Val Thr Val Gly Leu Tyr Leu Gin Glu Gly His 

85 90 95 

Lys Val Pro Gin Phe His Gly Lys Trp Pro Phe Ser Arg Phe Leu Phe 

100 105 HO 

Phe Gin Glu Pro Ala Ser Ala Val Ala Ser Phe Leu Asn Gly Leu Ala 

115 120 125 

Ser Leu Val Met Leu Cys Arg Tyr Arg Thr Phe Val Pro Ala Ser Ser 

130 135 140 

Pro Met Tyr His Thr Cys Val Ala Phe Ala Trp Val Ser Leu Asn Ala 
145 A 150 155 160 

Tro Phe Trp Ser Thr Val Phe His Thr Arg Asp Thr Asp Leu Thr Glu 

165 170 175 

Lys Met Asp Tyr Phe Cys Ala Ser Thr Val He Leu His Ser He Tyr 

180 185 190 

Leu Cys Cys Val Ser Phe Leu Glu Asp Asp Ser Leu Tyr Leu Leu Lys 

195 200 205 

Glu Ser Glu Asp Lys Phe Lys Leu Asp 

210 * 215 

<210> 90 

<211> 153 

<212> PRT 



<213> Homo sapiens 
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<400> 90 

Met Asn Val Gly Thr Ala His Ser Glu Val Asn Pro Asn Thr Arg Val 

5 10 is 

Met Asn Ser Arg Gly lie Trp Leu Ser Tyr Val Leu Ala lie Gly Leu 

20 25 30 

Leu His lie Val Leu Leu Ser He Pro Phe Val Ser Val Pro Val Val 

35 40 45 

Trp Thr Leu Thr Asn Leu He His Asn Met Gly Met Tyr He Phe Leu 

50 55 SO 

His Thr Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 

70 75 " 80 

Arg Leu Leu Thr His Trp Glu Gin Met Asp Tyr Gly Val Gin Phe Thr 

85 - .90 95 

Ala Ser Arg Lys Phe Leu Thr He Thr Pro He Val Leu Tyr Phe Leu 

100 1 05 no 

Thr Ser Phe Tyr Thr Lys Tyr Asp Gin He His Phe Val Leu Asn Thr 

115 120 125 

Val Ser Leu Met Ser Val Leu He Pro Lys Leu Pro Gin Leu His Gly 

130 135 140 

Val Arg He Phe Gly He Asn Lys Tyr 
"5 150 
<210> 91 

<211> 436 

<212> PRT 

<213> Homo sapiens 



<400> 91 

Met Arg Arg Asp Val Asn Gly Val Thr Lys Ser Arg Phe Glu Met Phe 

i 5 io 15 

Ser Asn Ser Asp Glu Ala Val He Asn Lys Lys Leu Pro Lys Glu Leu 

Leu Leu Arg He Phe Ser Phe Leu Asp Val Val Thr Leu c£s Arg Cys 

Ala Gin Val Ser Arg Ala Trp Asn Val Leu Ala Leu tip Gly Ser Asn 

iU 55 60 

Trp Gin Arg He Asp Leu Phe Asp Phe Gin Arg Asp He Glu Gly Arq 

7 ^ 75 an 

Val Val Glu Asn He Ser Lys Arg Cys Gly Gly Phe Leu Arg Lys Leu 

_ 85 90 as 

Ser Leu Arg Gly Cys Leu Gly Val Gly Asp Asn Ala Leu Arg Thr Phe 

100 iOS 2.10 

Ala Gin Asm Cys Arg Asn He Glu Val Leu Asn Leu Asn Gly Cys Thr 

115 120 125 

Lys Thr Thr Asp Ala Thr Cys Thr Ser Leu Ser Lys Phe Cys Ser Lys 

Leu Arg His Leu Asp Leu Ala Ser Cys Thr Ser lie Thr Asn Met Ser 

150 155 16Q 

Leu Lys Ala Leu Ser Glu Gly Cys Pro Leu Leu Glu Gin Leu Asn lie 

165 170 175 

Ser Trp Cys Asp Gin Val Thr Lys Asp Gly He Gin Ala Leu Val Arci 
180 185 i 90 3 

Gly Cys Gly Gly Leu Lys Ala Leu Phe Leu Lys Gly Cys Thr Gin Leu 

195 200 205 

Glu Asp Glu Ala Leu Lys Tyr He Gly Ala His Cys Pro Glu Leu Val 

210 215 220 

Thr Leu Asn Leu Gin Thr Cys Leu Gin He Thr Asp Glu Gly Leu He 
225 230 235 240 
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Thr 


He 


Cys 


Arg 


Gly 
245 


Cys His 


Lys 


Cys 


Ser 


Asn 


He 
260 


Thr 


Asp Ala 


He 


Pro 


Arg 


Leu 
275 


Arg 


He 


Leu Glu 


Val 
280 


Val 


Gly 
290 


Phe 


Thr 


Thr 


Leu Ala 
295 


Arg 


Asp 


Leu 


Glu 


Glu 


Cys 


Val Gin 


He 


305 










310 




Ser 


He 


His 


Cys 


Pro 
325 


Arg Leu 


Gin 


Leu 


He 


Thr 


Asp 


Asp 


Gly He Arg 








340 








His 


Asp 


Gin 
355 


Leu 


Glu 


Val He 


Glu 
360 


Asp 


Ala 
370 


Ser 


Leu 


Glu 


His Leu 
375 


Lys 


Glu 


Leu 


Tyr 


Asp 


Cys 


Gin Gin 


He 


385 










390 




Arg 


Thr 


His 


Leu 


Pro 
405 


Asn He 


Lys 


Thr 


Pro 


Pro 


Pro 


Ser 


Val Gly Gly 








420 








Cys 


He 


He 
435 


Leu 









<210> 92 

<211> 204 

<212> PRT 

<213> Homo sapiens 



<400> 92 



Met 


Asp 


Pro 


Lys 


Asp 


Arg 


Lys 


Lys 


1 








5 








Pro 


Ser 


Gin 


Leu 


Asp 


Pro Arg 


Gin 








20 










Pro 


Thr 


Pro 


Ala 


Met 


Leu 


Phe 


Arg 






35 










40 


Glu 


Glu 


Ala 


Ser 


Pro 


His 


Gin Arg 




50 










55 




Lys 


Ser 


Lys 


Arg 


Pro 


Asn 


Pro 


Cys 


65 










70 






Ala 


Val 


Gin 


Arg 


He 


Ala 


Glu 


Ser 










85 








Asn 


Glu 


Asn 


Gin 


Ala 


Ser 


Glu 


Glu 








100 










Glu 


Leu 


Gly 


Tyr 


Pro 


Arg 


Glu 


Glu 






115 










120 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 




130 










135 




He 


Arg 


Gin 


Ser 


Ala 


Gly Gin Lys 


145 










150 






Gly 


Pro 


Trp 


Glu Arg 


Pro 


Pro 


Pro 










165 








Gly 


Ser 


Glu 


Asp 


Gin 


Val 


Glu 


Asp 








180 










Glu 


Pro 


Gin 


Arg 


Pro 


Ser 


Pro 


Ser 






195 








200 



<210> 93 



<211> 115 
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Leu 


Gin 


Ser 


Leu 


Cys 


Ala 


Ser 


Gly 




250 








255 


Leu 


Asn 


Ala 


Leu 


Gly 


Gin 


Asn 


Cys 


265 








270 






Ala 


Arg 


Cys 


Ser 


Gin 


Leu 


Thr 


Asp 










285 








Asn 


Cys 


His 


Glu 


Leu 


Glu 


Lys 


Met 








300 










Thr 


Asp 


Ser 


Thr 


Leu 


He 


Gin 


Leu 






315 










320 


Val 


Leu 


Ser 


Leu 


Ser 


His 


Cys 


Glu 




330 










335 




His 


Leu 


Gly 


Asn 


Gly 


Ala 


Cys 


Ala 


345 










350 






Leu 


Asp 


Asn 


Cys 


Pro 


Leu 


He 


Thr 










365 








Ser 


Cys 


His 


Ser 


Leu 


Glu 


Arg 


He 








380 










Thr 


Arg 


Ala 


Gly 


He 


Lys 


Arg 


Leu 






395 










400 


Val 


His 


Ala 


Tyr 


Phe 


Ala 


Pro 


Val 




410 










415 




Ser 


Arg 




Arg 




Cys 


Arg 


Cys 


425 










430 






He 


Gin 


Phe 


Ser 


Val 


Pro 


Ala 


Pro 




10 










15 




Val 


Glu 


Met 


He 


Arg 


Arg 


Arg 


Arg 


25 










30 






Leu 


Ser 


Glu 


His 


Ser 


Ser 


Pro 


Glu 










45 








Ala 


Ser 


Gly 


Glu 


Gly 


His 


His 


Leu 








60 










Ala 


Tyr 


Thr 


Pro 


Pro 


Ser 


Leu 


Lys 






75 










80 


His 


Leu 


Gin 


Ser 


He 


Ser 


Asn 


Leu 




90 










95 




Glu 


Asp 


Glu 


Leu 


Gly 


Glu 


Leu 


Arg 


105 










110 






Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Asp 










125 








Ser 


Gin 


Ala 


Glu 


Val 


Leu 


Lys 


Val 








140 










Thr 


Thr 


Cys 


Gly 


Gin 


Gly 


Leu 


Glu 






155 










160 


Leu 


Asp 


Glu 


Ser 


Glu 


Arg 


Asp 


Gly 




170 










175 




Pro 


Ala 


Leu 


Ser 


Glu 


Pro 


Gly 


Glu 


185 










190 




Glu 


Pro 


Gly 


Thr 
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<212> PRT 

<213> Homo sapiens 



<400> 93 



Met 


Ser 


Gly Glu 


Pro Gly 


Gin Thr Ser Val 


Ala 


Pro 


Pro 


Pro 


Glu 


Glu 


1 






5 


10 










15 




Val 


Glu 


Pro Gly 
20 


Ser Gly 


Val Arg He Val 
25 


Val 


Glu 


Tyr 


Cys 
30 


Glu 


Pro 


Cys 


Gly 


Phe Glu 


Ala Thr 


Tyr Leu Glu Leu 


Ala 


Ser 


Ala 


Val 


Lys 


Glu 






35 




40 






45 






Gin 


Tyr 


Pro Gly 


He Glu 


He Glu Ser Arg Leu Gly Gly Thr Gly Ala 




50 






55 - 




60 










Phe 


Glu 


He Glu 


He Asn 


Gly Gin Leu Val 


Phe 


Ser 


Lys 


Leu 


Glu 


Asn 


65 






70 




75 








80 


Gly 


Gly 


Phe Pro 


Tyr Glu 


Lys Asp Leu He 


Glu 


Ala 


He 


Arg 


Arg 


Ala 








85 


90 








95 




Ser 


Asn 


Gly Glu 


Thr Leu 


Glu Lys He Thr 


Asn 


Ser 


Arg 


Pro 


Pro 


Cys 






100 




105 






110 




Val 


He 


Leu 
115 



















<210> 94 



<211> 144. 
<212> PRT 
<213> Homo sapiens 



<400> 94 






Met Gly 


Ala 


Val Val Leu 


Cys Arg 


1 




5 




Gin Thr 


Gly Thr Gly Gin 


Gly Leu 






20 




Cys Glu 


Ala 


Thr Pro Cys 


Gly Val 




35 




40 


Leu Leu 


Lys 


Gin His Arg 


Gly Arg 


50 






55 


Val Ser 


Ala 


Cys Arg Glu 


Glu Ser 


65 




70 




Trp Ser 


Leu 


Leu Pro Ser 


Pro Val 






85 




Lys Arg 


Cys 


Gly Ser Leu 


Cys Pro 






100 




Arg Gly 


His 


Trp Ala Cys 


Phe Leu 




115 




12 0 


Pro Cys 


He 


He Gly Asn 


Phe His 


130 






135 


<210> 95 






<211> 425 






<212> PRT 






<213> Homo 


sapiens 





Pro Ser Pro Leu Asn Phe Leu He 

10 15 
Ser Cys Gly Ser His Met Trp Arg 
25 30 
Cys Gly Glu Ser Pro Val Gly Ser 
45 

Gly Lys Thr Trp Pro Val Gly Thr 
60 

Glu Ala Gly Ser Leu Ser Leu Gly 

75 80 
Gly Leu Gly Ala Val Leu He Leu 

90 95 
Leu Pro Gly Val Gin Gly Asn Arg 
105 110 
Pro Pro Asp Pro Ala Ser Pro Thr 
125 

Leu Lys He Phe Leu Ser Lys Val 
140 



<400> 95 



Met Gly Gly Gly Asp Leu Asn Leu Lys Lys Ser Trp His Pro Gin Thr 
15 10 is 



BHC03 1 0 51 



-101- 

Leu Arg Asn Val Glu Lys Val Trp Lys Ala Glu Gin Lys His Glu Ala 

20 25 30 

Glu Arg Lys Lys lie Glu Glu Leu Gin Arg Glu Leu Arg Glu Glu Arg 

35 40 45 

Ala Arg Glu Glu Met Gin Arg Tyr Ala Glu Asp Val Gly Ala Val Lys 

50 55 60 

Lys Lys Glu Glu Lys Leu Asp Trp Met Tyr Gin Gly Pro Gly Gly Met 
65 70 75 80 

Val Asn Arg Asp Glu Tyr Leu Leu Gly Arg Pro lie Asp Lys Tyr Val 

85 90 95 

Phe Glu Lys Met Glu Glu Lys Glu Ala Gly Cys Ser Ser Glu Thr Gly 

100 105 110 

Leu Leu Pro Gly Ser He Phe Ala Pro Ser Gly Ala Asn Ser Leu Leu 

115 120 125 

Asp Met Ala Ser Lys He Arg Glu Asp Pro Leu Phe lie He Arg Lys 

130 135 140 

Lys Glu Glu Glu Lys Lys Arg Glu Val Leu Asn Asn Pro Val Lys Met 
145 150 155 160 

Lys Lys He Lys Glu Leu Leu Gin Met Ser Leu Glu Lys Lys Glu Lys 

165 170 175 

Lys Lys Lys Lys Glu Lys Lys Lys Lys His Lys Lys His Lys His Arg 

180 185 190 

Ser Ser Ser Ser Asp Arg Ser Ser Ser Glu Asp Glu His Ser Ala Gly 

195 200 205 

Arg Ser Gin Lys Lys Met Ala Asn Ser Ser Pro Val Leu Ser Lys Val 

21 0 215 220 

Pro Gly Tyr Gly Leu Gin Val Arg Asn Ser Asp Arg Asn Gin Gly Leu 
225 230 235 240 

Gin Gly Pro Leu Thr Ala Glu Gin Lys Arg Gly His Gly Met Lys Asn 

245 250 " 255 

His Ser Arg Ser Arg Ser Ser Ser His Ser Pro Pro Arg His Ala Ser 

260 265 270 

Lys Lys Ser Thr Arg Glu Ala Gly Ser Arg Asp Arg Arg Ser Arg Ser 

275 280 285 

Leu Gly Arg Arg Ser Arg Ser Pro Arg Pro Ser Lys Leu His Asn Ser 

290 295 300 

Lys Val Asn Arg Arg Glu Thr Gly Gin Thr Arg Ser Pro Ser Pro Lys 
305 310 315 320 

Lys Glu Val Tyr Gin Arg Arg His Ala Pro Gly Tyr Thr Arg Lys Leu 

325 330 335 

Ser Ala Glu Glu Leu Glu Arg Lys Arg Gin Glu Met Met Glu Asn Ala 

340 345 350 

Lys Trp Arg Glu Glu Glu Arg Leu Asn He Leu Lys Arg His Ala Lys 

355 360 365 

Asp Glu Glu Arg Glu Gin Arg Leu Glu Lys Leu Asp Ser Arg Asp Gly 

370 375 380 

Lys Phe He His Arg Met Lys Leu Glu Ser Ala Ser Thr Ser Ser Leu 
3 & 5 390 395 400 

Glu Asp Arg Val Lys Arg Asn He Tyr Ser Leu Gin Arg Thr Ser Val 

405 410 415 

Ala Leu Glu Lys Asn Phe Met Lys Arg 
420 425 

<210> 96 

<211> 394 

<212> PRT 

<213> Homo sapiens 



<400> 96 

Met Phe Ser Val Phe Glu Glu He Thr Arg He Val Val Lys Glu Met 
1 5 10 is 
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Asp Ala Gly Gly Asp Met He Ala Val Arg Ser Leu Val Asp Ala Asp 

20 25 30 

Arg Phe Arg Cys Phe His Leu Val Gly Glu Lys Arg Thr Phe Phe Gly 

35 40 45 

Cys Arg His Tyr Thr Thr Gly Leu Thr Leu Met Asp He Leu Asp Thr 

50 55 60 

His Gly Asp Lys Trp Leu Asp Glu Leu Asp Ser Gly Leu Gin Gly Gin 
65 70 75 80 

Lys Ala Glu Phe Gin He Leu Asp Asn Val Asp Ser Thr Gly Glu Leu 

85 90 95 

He Val Arg Leu Pro Lys Glu He Thr He Ser Gly Ser Phe Gin Gly 

100 105 HO 

Phe His His Gin Lys He Lys He Ser Glu Asn Arg He Ser Gin Gin 

115 120 125 

Tyr Leu Ala Thr Leu Glu Asn Arg Lys Leu Lys Arg Glu Leu Pro Phe 

130 135 140 

Ser Phe Arg Ser He Asn Thr Arg Glu Asn Leu Tyr Leu Val Thr Glu 
145 150 155 160 

Thr Leu Glu Thr Val Lys Glu Glu Thr Leu Lys Ser Asp Arg Gin Tyr 

165 * 170 175 

Lys Phe Trp Ser Gin He Ser Gin Gly His Leu Ser Tyr Lys His Lys 

180 185 190 

Gly Gin Arg Glu Val Thr He Pro Pro Asn Arg Val Leu Ser Tyr Arg 

195 200 205 

Val Lys Gin Leu Val Phe Pro Asn Lys Glu Thr Met Arg Lys Ser Leu 

210 215 220 

Gly Ser Glu Asp Ser Arg Asn Met Lys Glu Lys Leu Glu Asp Met Glu 
225 230 235 240 

Ser Val Leu Lys Asp Leu Thr Glu Glu Lys Arg Lys Asp Val Leu Asn 

245 250 255 

Ser Leu Ala Lys Cys Leu Gly Lys Glu Asp He Arg Gin Asp Leu Glu 

260 265 270 

Gin Arg Val Ser Glu Val Leu He Ser Gly Glu Leu His Met Glu Asp 

275 280 285 

Pro Asp Lys Pro Leu Leu Ser Ser Leu Phe Asn Ala Ala Gly Val Leu 

290 295 300 

Val Glu Ala Arg Ala Lys Ala He Leu Asp Phe Leu Asp Ala Leu Leu 
305 310 315 320 

Glu Leu Ser Glu Glu Gin Gin Phe Val Ala Glu Ala Leu Glu Lys Gly 

325 330 335 

Thr Leu Pro Leu Leu Lys Asp Gin Val Lys Ser Val Met Glu Gin Asn 

340 ^ 345 350 

Trp Asp Glu Leu Ala Ser Ser Pro Pro Asp Met Asp Tyr Asp Pro Glu 

355 360 365 

Ala Arg He Leu Cys Ala Leu Tyr Val Val Val Ser He Leu Leu Glu 

370 375 380 

Leu Ala Glu Gly Pro Thr Ser Val Ser Ser 

390 



385 




<210> 


97 


<211> 


456 


<212> 


PRT 


<213> 


Homo 



<400> 97 

Met Glu Gly Pro Glu Gly Leu Gly 
1 5 
Leu Leu His Phe Leu Asp Thr Tyr 
20 

Gly Ala Gly His He He Lys Asp 
35 40 



Arg Lys Gin Ala Cys Leu Ala Met 

10 15 
Gin Gly Leu Leu Gin Glu Glu Glu 
25 30 
Leu Tyr Leu Leu He Met Lys Asp 
45 
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Glu Ser Leu Tyr Gin Gly Leu Arg Glu Asp Thr Leu Arg Leu His Gin 

50 55 60 

Leu Val Glu Thr Val Glu Leu Lys lie Pro Glu Glu Asn Gin Pro Pro 

70 75 80 

Ser Lys Gin Val Lys Pro Leu Phe Arg His Phe Arg Arg lie Asp Ser 

85 90 95 

Cys Leu Gin Thr Arg Val Ala Phe Arg Gly Ser Asp Glu lie Phe Cys 

100 105 no 

Arg Val Tyr Met Pro Asp His Ser Tyr Val Thr He Arg Ser Arg Leu 

115 120 125 

Ser Ala Ser Val Gin Asp He Leu Gly Ser Val Thr Glu Lys Leu Gin 

130 135 140 

Tyr Ser Glu Glu Pro Ala Gly Arg Glu Asp Ser Leu He Leu Val Ala 

150 155 16Q 

Val Ser Ser Ser Gly Glu Lys Val Leu Leu Gin Pro Thr Glu Asp Cys 

16 5 170 175 

Val Phe Thr Ala Leu Gly lie Asn Ser His Leu Phe Ala Cys Thr Arc* 

"0 185 iSo 

Asp Ser Tyr Glu Ala Leu Val Pro Leu Pro Glu Glu He Gin Val Ser 

195 200 205 

Pro Gly Asp Thr Glu He His Arg Val Glu Pro Glu Asp Val Ala Asn 

210 215 220 

His Leu Thr Ala Phe His Trp Glu Leu Phe Arg Cys Val His Glu Leu 

230 235 240 

Glu Phe Val Asp Tyr Val Phe His Gly Glu Arg Gly Arg Arg Glu Thr 

245 250 255 

Ala Asn Leu Glu Leu Leu Leu Gin 'Arg Cys Ser Glu Val Thr His Tiro 

260 265 270 

Val Ala Thr Glu Val Leu Leu Cys Glu Ala Pro Gly Lys Arg Ala Gin 
275 ' 280 285 

LyS LyS Phe Ile Lys Ile Ala Leu Cys Lys Gin Asn Gin 

290 295 300 

Asp Leu Leu Ser Phe Tyr Ala Val Val Met Gly Leu Asp Asn Ala Ala 

310 315 320 

Val Ser Arg Leu Arg Leu Thr Trp Glu Lys Leu Pro Gly Lys Phe Lys 

325 330 335 

Asn Leu Phe Arg Lys Phe Glu Asn Leu Thr Asp Pro Cys Arg Asn His 

340 345 350 

Lys Ser Tyr Arg Glu Val He Ser Lys Met Lys Pro Pro Val He Pro 

355 360 365 

Phe Val Pro Leu He Leu Lys Asp Leu Thr Phe Leu His Glu Gly Ser 

370 375 380 

Lys Thr Leu Val Asp Gly Leu Val Asn He Glu Lys Leu His Ser Val 

390 395 400 

Ala Glu Lys Val Arg Thr He Arg Lys Tyr Arg Ser Arg Pro Leu Cys 

405 410 ~ 4i 5 

Leu Asp Met Glu Ala Ser Pro Asn His Leu Gin Thr Lys Ala Tyr Val 

420 425 43 0 

Arg Gin Phe Gin Val He Asp Asn Gin Asn Leu Leu Phe Glu Leu Ser 

435 440 445 

Tyr Lys Leu Glu Ala Asn Ser Gin 

450 455 
<210> 98 

<211> 715 

<212> PRT 

<213> Homo sapiens 
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Ser Leu Ala Pro Cys Asp Glu Pro Arg Arg Thr Leu His Pro Ala Pro 

20 ' 25 30 

Ser Pro Ser Leu Pro Pro Gin Cys Ser Tyr Tyr Thr Thr Glu Gly Trp 

35 40 45 

Gly Ala Gin Ala Leu Met Ala Pro Val Pro Cys Met Gly Pro Pro Gly 

50 55 60 

Arg Leu Gin Gin Ala Pro Gin Val Glu Ala Lys Ala Thr Cys Phe Leu 
65 70 75 80 

Pro Ser Pro Gly Glu Lys Ala Leu Gly Thr Pro Glu Asp Leu Asp Ser 

85 ~ 90 95 

Tyr lie Asp Phe Ser Leu Glu Ser Leu Asn Gin Met lie Leu Glu Leu 

100 105 HO 

Asp Pro Thr Phe Gin Leu Leu Pro Pro Gly Thr Gly Gly Ser Gin Ala 

115 120 125 

Glu Leu Ala Gin Ser Thr Met Ser Met Arg Lys Lys Glu Glu Ser Glu 

130 135 140 

Ala Leu Asp He Lys Tyr He Glu Val Thr Ser Ala Arg Ser Arg Cys 
145 150 155 160 

His Asp Trp Pro Gin His Cys Ser Ser Pro Ser Val Thr Pro Pro Phe 

165 170 175 

Gly Ser Pro Arg Ser Gly Gly Leu Leu Leu Ser Arg Asp Val Pro Arg 

180 185 190 

Glu Thr Arg Ser Ser Ser Glu Ser Leu He Phe Ser Gly Asn Gin Gly 

195 200 205 

Arg Gly His Gin Arg Pro Leu Pro Pro Ser Glu Gly Leu Ser Pro Arg 

210 215 220 

Pro Pro Asn Ser Pro Ser He Ser He Pro Cys Met Gly Ser Lys Ala 
225 230 235 240 

Ser Ser Pro His Gly Leu Gly Ser Pro Leu Val Ala Ser Pro Arg Leu 

245 250 255 

Glu Lys Arg Leu Gly Gly Leu Ala Pro Gin Arg Gly Ser Arg He Ser 

260 265 270 

Val Leu Ser Ala Ser Pro Val Ser Asp Val Ser Tyr Met Phe Gly Ser 

275 280 285 

Ser Gin Ser Leu Leu His Ser Ser Asn Ser Ser His Gin Ser Ser Ser 

290 295 300 

Arg Ser Leu Glu Ser Pro Ala Asn Ser Ser Ser Ser Leu His Ser Leu 
305 310 315 320 

Gly Ser Val Ser Leu Cys Thr Arg Pro Ser Asp Phe Gin Ala Pro Arg 

325 330 335 

Asn Pro Thr Leu Thr Met Gly Gin Pro Arg Thr Pro His Ser Pro Pro 

340 345 350 

Leu Ala Lys Glu His Ala Ser He Cys Pro Pro Ser He Thr Asn Ser 

355 360 365 

Met Val Asp He Pro He Val Leu He Asn Gly Cys Pro Glu Pro Gly 

370 375 380 

Ser Ser Pro Pro Gin Arg Thr Pro Gly His Gin Asn Ser Val Gin Pro 
^85 390 395 400 

Gly Ala Ala Ser Pro Ser Asn Pro Cys Pro Ala Thr Arg Ser Asn Ser 

405 410 415 

Gin Thr Leu Ser Asp Ala Pro Phe Thr Thr Cys Pro Glu Gly Pro Ala 

420 425 430 

Arg Asp Met Gin Pro Thr Met Lys Phe Val Met Asp Thr Ser Lys Tyr 

435 440 445 

Trp Phe Lys Pro Asn He Thr Arg Glu Gin Ala He Glu Leu Leu Arg 

450 455 460 

Lys Glu Glu Pro Gly Ala Phe Val He Arg Asp Ser Ser Ser Tyr Arg 
465 470 475 480 

Gly Ser Phe Gly Leu Ala Leu Lys Val Gin Glu Val Pro Ala Ser Ala 

485 490 495 

Gin Asn Arg Pro Gly Glu Asp Ser Asn Asp Leu He Arg His Phe Leu 

500 505 510 

He Glu Ser Ser Ala Lys Gly Val His Leu Lys Gly Ala Asp Glu Glu 

515 520 525 

Pro Tyr Phe Gly Ser Leu Ser Ala Phe Val Cys Gin His Ser He Met 
530 535 540 
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Ala Leu Ala Leu Pro Cys Lys Leu Thr lie Pro Gla Glu Leu Qly 

Gly Ala Asp Gly Ala Ser Asp Ser Thr Asp Ser Pro Ala Ser Cys Gin 

565 570 575 

Lys Lys Ser Ala Gly Cys His Thr Leu Tyr Leu Ser Ser Val Ser Val 

580 585 590 

Glu Thr Leu Thr Gly Ala Leu Ala Val Gin Lys Ala lie Ser Thr Thr 

595 600 505 

Phe Glu Arg Asp He Leu Pro Thr Pro Thr Val Val His Phe Glu Val 

. 610 615 620 

Thr Glu Gin Gly He Thr Leu Thr Asp Val Gin Arg Lys Val Phe Phe 

630 635 ~ 64Q 

Arg Arg His Tyr Pro Leu Thr Thr Leu Arg Phe Cys Gly Met Asp Pro 

, 645 650 655 

Glu Gin Arg Lys Trp Gin Lys Tyr Cys Lys Pro Ser Trp He Phe Glv 

660 665 670 

Phe Val Ala Lys Ser Gin Thr Glu Pro Gin Glu Asn Val Cys His Leu 

675 680 685 

Phe Ala Glu Tyr Asp Met Val Gin Pro Ala Ser Gin Val He Gly Leu 

690 695 700 

Val Thr Ala Leu Leu Gin Asp Ala Glu Arg Met 
705 710 715 

<210> 99 

<211> 35 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 99 

ccatatataa aaccactgtc ctgtcctttg tggct 

<210> 100 ya 

<211> 26 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 100 

cccccatctg tctgtctata tttgtc 

<210> 101 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 101 
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tgcctacgct gacgactatg tg 
<210> 102 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 102 

tttggttttc tacaactgtt gctat 
<210> 103 



<211> 19 



<212> DNA 



<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 103 

gggctccaca caccagatg 
<210> 104 



<211> 21 



<212> DNA 



<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 104 

acgctctgag caccctctac a 

<210> 105 

<211> 31 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 105 

tgtcacaggg actgaaaacc tctcctcatg 
<210> 106 



<211> 17 



<212> DNA 
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<213> Artificial sequence 



<220> 

<223> PGR primer 

<400> 106 
cccaaggcca cgagctt 

<210> 107 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 107 

tgttgctctc ttaacgaatc gaaa 

<210> 108 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 108 

ctggtcaaac aaactctctg aacccctcc 

<210> 109 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 109 

tggtgaggaa aagcggacat 

<210> 110 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> PCR primer 

<40O> 110 

ctggcttgga ggacagtgaa g 

<210> 111 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 111 

ccaagccctc cccatcccat gtat 

<210> 112 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 112 

gaggtgtcgt accgcgttct a 

<210> 113 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 113 

ccgttctgct cttccctgtc t 

<210> 114 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 114 

ccagacccgc ttcactgacc tgc 
<210> 115 
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<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

i 

<223> PCR primer 
<400> 115 

cgcctgtact tcagcatgga 20 
<210> 116 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 116 

gcggttcagc tggtggaa 18 

<210> 117 

<211> 25 ' 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 117 

accccgaggc atcaccacaa atcat 25 

<210> 118 

<211> 23 

<212> DNA 



<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 118 

agttctgcct ctctgacaac cat 23 

<210> 119 

<211> 23 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> PCR primer 

<400> 119 

taggctcaga gtcagaccca aac 

<210> 120 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 120 

ccctcgtggg cttgtgctcg g 

<210> 121 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 121 

aagccgccag ttcatctttt t 

<210> 122 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 122 

cttgtggttc aagtcaaatg ttcag 

<210> 123 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 123 
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tctgcctgcg ctctcgtcgg t 
<210> 124 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 124 
gggctgggca cctgactt 

<210> 125 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 125 

cccaacaagg gtcccagact 

<210> 126 

<211> 17 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 126 
cggcgcattg agcggcg 

<210> 127 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 127 

cccaagggac ttcgtgaatg 

<210> 128 

<211> 21 



<212> DNA 
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<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 128 

ggcgatccct gatgacaagt a 

<210> 129 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer# 

<400> 129 

agcaccaact gtgaaccagg tacaatggc 

<210> 130 

<211> 19 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 130 

gagggaggct ctgctttgg 

<210> 131 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 131 

tcacaactag cgggtgagga g 

<210> 132 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> PCR primer 

<400> 132 

tgcagaggaa cggcgtgagc g 

<210> 133 

<211> 22 

<212> DNA 

<213> /Artificial sequence 



<220> 

<223> PCR primer 

<400> 133 

tgaggtttcc tcccaaatcg ta 

<210> 134 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 134 

cagctcaagg gaagctgtca tc 

<210> 135 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 135 

cccccacatg ttccccaaga tgct 

<210> 136 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 136 

ggaggcgcta aaggtctacg t 
<210> 137 
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<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 137 

tgatgcttcg caggtcagta a 

<210> 138 

<2il> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 138 

ctcctgcccc tcctaaagct gaagcc 

<210> 139 

<211> 17 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 139 
ggacgcgtgg gcttttc 

<210> 140 

<211> 20 

<2V>> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 140 

tgtggctgtg gacacctttc 

<210> 141 

<211> 25 

<212> DNA 



<213> Artificial sequence 
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<220> 

<223> PGR primer 

<400> 141 

ccacaagctg aaggcagaca aggcc 

<210> 142 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 142 

gcggattctc atggaacaca 

<210> 143 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 143 

ggtcagccag gagcttcttg 

<210> 144 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 144 

accaccttgc gcaggttgtc cag 

<210> 145 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 145 



cgcatgcacg acctgaac 
<210> 146 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 146 

gtctcgatct tggacagctt ctg 

<210> 147 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 147 

acactgtcca cacggcccga gg 

<210> 148 ~ 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 148 

ctgggcagaa tggaaggatc t 
<210> 149 



<211> 22 



<212> DNA 

<213> Artificial sequence 



<220> 



<223> pcr primer 
<400> 149 

gggactctag cagacccaca ct 
<210> 150 

<211> 22 



<212> DNA 
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<213> 


Artificial sequence 


<220> 




<223> PCR primer 
<400> 150 

cacccacctg gattccctgt tc 
<210> 151 


<211> 


23 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> PCR primer 
<400> 151 

ccttcagaca ggcgtagatg atg 
<210> 152 


<211> 


29 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 
<40O> 


. PCR primer 
152 



gggtattatt tctttattag gtgccactt 
<210> 153 

<211> 30 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 153 

ttccctaagg ctttcagtac ccaggatctg 

<210> 154 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> PCR primer 

<400> 154 
ccagcttggc cctttcct 

<210> 155 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 155 

gaatgggtcg cttttgttct tag 

<210> 156 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 156 

tcacggacct cagcctgccc ct 

<210> 157 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 157 

tggtgaaggt gtcagccatg t 

<210> 158 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 158 

tcagagtgca gcaatggctt t 
<210> 159 
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<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 
<400> 159 

acctccttcc ccagctcccc 
<210> 160 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 160 

ggcaacatct tacttgtcct ttga 

<210> 161 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 161 

ccaaggaagc acagacaact atttc 
<210> 162 

<211> 30 

<212> DNA 



<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 162 

tcctccctat ccatggcact aaaccacttc 
<210> 163 

<211> 19 

<212> DNA 



<213> Artificial sequence 
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<220> 

<223> PCR primer 

<400> 163 

tgggcaaggg ctcctatct 

<210> 164 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 164 

gttacccctg gcagacgtat g 

<210> 165 

<211> 31 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 165 

tgcctctgag tctgaatctc ccaaagagag 

<210> 166 

<211> 31 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 166 

gagtagttat gtgattattt cagctcttga 

<210> 167 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 167 
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tcaaatgttg tccccgagtc t 
<210> 168 

<211> 34 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 168 

cagaaattcg gaagacagaa ctattgtcat gcct 

<210> 169 

<211> 27 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 169 

gattagtaac ccatagcagt tgaaggt 

<210> 170 

<211> 26 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 170 

atttactgac ggtggtctga acatac 

<210> 171 

<211> 31 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 171 

tgacagactc caaatcacaa gcacagtcaa c 

<210> 172 

<211> 25 

<212> DNA 
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<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 172 

tgatggtttg gaggaaagtt tattt 

<210> 173 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 173 

tttggttggg tctttagagg aatc 

<210> 174 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 174 

tgccaaccat gcatcaggta gccc 

<210> 175 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 175 

cagctcacct ggcaacttca 

<210> 176 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> PCR primer 

<400> 176 

cctgattttc ccagcgatgt 

<210> 177 

<211> 19 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 177 

cgccgctccc ggttctgct 

<210> 178 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 

<400> 178 

tggccaagcg taagctgatt 

<210> 179 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 



<223> PCR primer 
<400> 179 

gctgcagtga tcggatcatc t 
<210> 180 

<211> 22 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MLLT6 
<400> 180 

caccatggag cccatcgtgc tg 
<210> 181 



BHC03 1 051 



-124- 

<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> MLLT6 for 
<400> 181 

atccccgagg tgcaatttg 
<210> 182 



<211> 21 



<212> DNA 



<213> Artificial Sequence 




<220> 

<223> MLLT6 rev 
<400> 182 

agcgatcatg aggcacgtac t 
<210> 183 



<211> 29 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> ZNF144 
<400> 183 

cctgccagag ataggagacc cagacagct 

<210> 184 ^ 
<211> 19 



<213> Artificial Sequence 



<220> 

<223> ZNF144 for 

<400> 184 

atccccctga gccttttca 

<210> 185 

<211> 19 

<212> DNA 



<213> Artificial Sequence 



BHU 03 1 051 



<220> 

<223> ZNF144 rev 

<400> 185 

cagcctctgg tcccaccat 

<210> 186 

<211> 28 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PIP5K2B 

<400> 186 

tgatcatcaa ttccaaacct ctcccgaa 

<210> 187 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PIP5K2B for 

<400> 187 

ccccatggtg ttccgaaac 

<210> 188 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PIP5K2B rev 

<400> 188 

tgccaggagc ctccatacc 

<210> 189 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> TEM7 
<400> 189 



BHC03 1 051 



- 126- 

cagccttcta aaacacaatg tattcatgt 
<210> 190 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> TEM7 for 

<400> 190 

cctgaactta atggtagaat tcaaagatc. 

<210> 191 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> TEM7 rev 

<400> 191 

tattaacact gagaatccat gcagaga 

<210> 192 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ZNFN1A3 

<400> 192 

tatctggtct cagggattgc tcctatgtat tcagc 

<210> 193 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ZNFN1A3 for 
<400> 193 

cacagagccc tgctgaagtg 
<210> 194 

<211> 23 



<212> DNA 



BHC03 1 051 



<213> Artificial Sequence 



<220> 

<223> ZNFN1A3 rev 

<400> 194 

gcgaggtcat tggtttttag aaa 

<210> 195 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> WIRE 

<400> 195 

ctgtgatccg aaatggtgcc ag 

<210> 196 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> WIRE for 

<400> 196 

ccgtctccac atccaaacct 

<210> 197 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> WIRE rev 

<400> 197 

acccatgcat tcggtatggt 

<210> 198 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 



BHC03 1 051 



<223> PSMB3 

<400> 198 

agtggcacct gcgccgaaca a 

<210> 199 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PSMB3 for 

<400> 199 

ccccatggtg actgatgact t 

<210> 200 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PSMB3 rev 

<400> 200 

ccagagggac tcacacattc c 

<210> 201 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MGC9753 

<400> 201 

ccagaaactt tccatcccaa aggcagtct 

<210> 202 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MGC9753 for 
<400> 202 

ctgccccaca ggaatagaat g 
<210> 203 



BHC03 1 051 



<211> 23 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> MGC9753 rev 

<400> 203 

aaaaatccag tctgcttcaa cca 

<210> 204 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> ORMDL3 

<400> 204 

agctgcccca gctccacgga 

<210> 205 

<211> 21 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> ORMDL3 for 

<400> 205 

tccctgatga gcgtgcttat c 

<210> 206 

<211> 28 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> ORMDL3 rev 

<400> 206 

tctcagtact tattgattcc aaaaatcc 

<210> 207 

<211> 25 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



BHC03 1 051 



<220> 

<223> MGC15482 

<400> 207 

tccagtggaa gcaaccccag tgttc 

<210> 208 

<211> 25 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> MGC15482 for 

<400> 208 

cacttctaga gctaccgtgg agtct 

<210> 209 

<211> 22 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> MGC15482 rev 

<400> 209 

ccctcacttt gtaacccttg ct 

<210> 210 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> PPP1R1B 

<400> 210 

cagcgtggcg caacaaccca 

<210> 211 

<211> 21 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> PPP1R1B for 
<400> 211 



BHC03 1 051 



gggattgttt cgccacacat a 
<210> 212 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 



<223> PPP1R1B rev 
<400> 212 

ccgatgttaa ggcccatagc 
<210> 213 



<211> 27 



<212> DNA 



<213> ARTIFICIAL SEQUENCE 



<220> 

<223> MGC14832 

<400> 213 

taaaatgtcc ggccaacatg agttccc 

<210> 214 

<211> 17 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 



<223> MGC14832 for 
<400> 214 
cgcagtgcct ggcacat 
<210> 215 



<211> 20 



<212> DNA 



<213> ARTIFICIAL SEQUENCE 



<220> 



<223> MGC14832 rev 
<400> 215 

gacaccccct gacctatgga 
<210> 216 



<211> 25 
<212> DNA 



BHC03 1 051 



<213> ARTIFICIAL SEQUENCE 



<220> 

<223> LOC51242 

<400> 216 

cagtgacctc tcccgttccc ttgga 

<210> 217 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> LOC51242 for 

<400> 217 

tgggtccctg tgtcctcttc 

<210> 218 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> LOC51242 for 

<400> 218 

agggtcagga gggagaaaac 

<210> 219 

<211> 26 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> FLJ20291 

<400> 219 

ccagtgccca cccgttaaag agtcaa 

<210> 220 

<211> 24 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 



BHC 03 1 051 



<223> FLJ20291 for 

<400> 220 

ttgtgggaca ctcagtaact ttgg 

<210> 221 

<211> 20 

<2X2> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> FLJ20291 rev 

<400> 221 

acaagcactc ccaccgagat 

<210> 222 

<211> 24 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> PR02521 

<400> 222 

agtctgtcct cactgccatc gcca 

<210> 223 

<211> 21 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> PR02521 for 

<400> 223 

aagcctctgg gttttccctt t 

<210> 224 

<211> 20 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> PR02521 rev 
<400> 224 

cccactggtg acaggatggt 
<210> 225 



BHC03 1 051 



<211> 


23 


<212> 


DNA 


<213> 


ARTIFICIAL SEQUENCE 


<220> 




<223> LINK-GEFII 
<400> 225 

catctgacat ctttcccgtg gag 
<210> 226 


<211> 


21 - 


<212> 


DNA 


<213> 


ARTIFICIAL SEQUENCE 


<220> 




<223> LINK-GEFII for 
<400> 226 . 

ctttgcacga tgtctcaacc a 
<210> 227 


<211> 


18 


<212> 


DNA 


<213> 


ARTIFICIAL SEQUENCE 


<220> 




<223> LINK-GEFII rev 
<400> 227 
tttcccgtgg agcaggaa 
<210> 228 


<211> 


26 


<212> 


DNA 


<213> 


ARTIFICIAL SEQUENCE 


<220> 




<223> 


CTEN 



<400> 228 

ccgccgccta atatgcaaca ttaggg 
<210> 229 



<211> 23 



<212> DNA 



<213> ARTIFICIAL SEQUENCE 



BHC03 1 051 



<220> 

<223> CTEN for 

<400> 229 

cgagtattcc aaagctggta teg 

<210> 230 

<211> 24 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 



<220> 

<223> CTEN rev 

<400> 230 

atcacagaga gatggecett atct 

<210> 231 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 



<223> D17S946 forward primer 

<400> 231 

acagtctatc aagcagaaaa atcct 

<210> 232 

<211> 16 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S946 reverse primer 

<400> 232 
tgccgtgcca gagaga 

<210> 233 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> . D17S1181 forward primer 
<400> 233 



BHC03 1 051 



gacaacagag cgagactccc 
<210> 234 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1181 reverse primer 

<400> 234 

gcccagcctg tcacttattc 

<210> 235 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2026 forward primer 

<400> 235 
tggtcattcg acaacgaa 

<210> 236 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2026 reverse primer 

<400> 236 
cagcattgga tgcaatcc 

<210> 237 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S83 8 forward primer 

<400> 237 

ctccagaatc cagaccatga 

<210> 238 

<211> 20 

<212> DNA 



BHC 03 1 051 



137- 



<213> Artificial Sequence 



<220> 

<223> D17S83 8 reverse primer 

<400> 238 

aggacagtgt gtagcccttc 20 

<210> 239 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S250 forward primer 

<400> 239 

ggaagaatca aatagacaat 20 

<210> 240 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S250 reverse primer 

<400> 240 

gctggccata tatatattta aacc 24 

<210> 241 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1818 forward primer 

<400> 241 

cataggtatg ttcagaaatg tga 23 

<210> 242 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 



BHC03 1 051 



• 138 - 

<223> D17S1818 reverse primer 
<400> 242 

tgcctactgg aaaccaga 18 
<210> 243 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S614 forward primer 

<400> 243 

aaggggaagg ggctttcaaa get 23 

<210> 244 

<211> 23 

<212> DNA 



<213> Artificial Sequence 



<220> 

<223> D17S614 reverse primer 
<220> 

<221> misc_feature 

<222> (1)..(1) 

<223> n=a, c, g or t 

<400> 244 

nggaggttgc agtgagccaa gat 23 

<210> 245 

<211> 23 ^ 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2019 forward primer 

<400> 245 

caaaagctta tgatgctcaa acc 23 

<210> 246 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



BHC03 1 051 



<220> 

<223> D17S2019 reverse primer 
<400> 246 

ttgtttccct ttgactttct ga 
<210> 247 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S606 forward primer 

<400> 247 

taggttcacc tctcattttc ttcag 

<210> 248 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S608 reverse primer 
<220> 

<22l> misc_feature 

<222> (17).. (17) 

<223> n=a, c, g or t 

<400> 248 

gtctgggtct ttatggngct tgtg 
<210> 249 ~ 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1655 forward primer 

<400> 249 

cggaccagag tgttccatgg 

<210> 250 

<211> 20 



<212> DNA 



BHC 03 1 051 



<213> Artificial Sequence 



<220> 

<223> D17S1655 reverse primer 
<400> 250 

gcatacagca ccctctacct 
<210> 251 

<211> 25 

<212> . DNA . 

<213> Artificial Sequence 



<220> 

<223> D17S2147 forward primer 

<400> 251 

aggggagaat aaataaaatc tgtgg 

<210> 252 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2147 reverse primer 

<400> 252 

caggagtgag acactctcca tg 

<210> 253 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S754 forward primer 

<400> 253 

tggattcact gactcagcct gc 

<210> 254 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 



BHC03 1 051 



<223> D17S754 reverse primer 

<40O> 254 

gcgtgtctgt ctccatgtgt gc 

<210> 255 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223>' D17S1814 forward primer 

<400> 255 
tccccaatga cggtgatg 

<210> 256 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 



<223> D17S1814 reverse primer 
<400> 256 

ctggaggttg gcttgtggat 
<210> 257 



<211> 18 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> D17S2007 forward primer 
<400> 257 
ggtcccacga atttgctg 
<210> 258 



<211> 20 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> D17S2007 reverse primer 
<400> 258 

ccacccagaa aaacaggaga 
<210> 259 



BHC03 1 051 



<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1246 forward primer 

<400> 259 

tcgatctcct gaccttgtga 

<210> 260 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1246 reverse primer 

<400> 260 

ttgtcacccc attgcctttc 

<210> 261 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1979 forward primer 

<400> 261 

ccttggatag attcagctcc c 

<210> 262 

<211> 21 

<2i2> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1979 reverse primer 

<400> 262 

cttgtccctt ctcaatcctc c 

<210> 263 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



BHC03 1 051 



<220> 

<223> D17S1984 forward primer 

<400> 263 

ttaagcaagg ttttaattaa gctgc 

<210> 264 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1984 reverse primer 

<400> 264 

gattacagtg ctccctctcc c 

<210> 265 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> G11580 forward primer 

<400> 265 

ggttttaatt aagctgcatg gc 

<210> 266 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> G11580 reverse primer 

<400> 266 

gattacagtg ctccctctcc c 

<210> 267 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1867 forward primer 
<400> 267 



agtttgacac tgaggctttg 
<210> 268 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1867 reverse primer 

<400> 268 

tttagacttg gtaactgccg 

<210> 269 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1788 forward primer 

<400> . 269 

tgcagatgcc taagaacttt tcag 

<210> 270 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1788 reverse primer 

<400> 270 

gccatgatct cccaaagcc 

<210> 271 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 



<223> D17S1836 forward primer 
<400> 271 
tcgaggttat ggtgagcc 
<210> 272 

<211> 24 



<212> DNA 



BHC03 1 051 



<213> Artificial Sequence 



<220> 

<223> D17S1836 reverse primer 
<400> 272 

aaactgtgtg tgtcaaagga tact 
<210> 273 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1787 forward primer 

<400> 273 

gctgatctga agccaatga 

<210> 274 

<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1787 reverse primer 
<400> 274 

tacatgaagg catggtctg 
<210> 275 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S1660 forward primer 

<400> 275 

ctaatataat cctgggcaca tag 

<210> 276 

<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 



BHC03 1 051 



-146- 



<223> D17S1660 reverse primer 

<400> 276 
gctgcggacc agacagat 

<210> 277 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2154 forward primer 
<400> 277 

gataaaaaca agcactggct cc 
<210> 278 



<211> 20 




<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2154 reverse primer 

<400> 278 

cccacggctt tcttgatcta 

<210> 279 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 




<223> D17S1955 forward primer 

<4GG> 279 

tgtaatgtaa gccccatgag g 

<210> 280 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1955 reverse primer 
<400> 280 

cactcaactc aacagtctaa aggtg 
<210> 281 



BHC03 1 051 



<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S2098 forward primer 

<400> 281 

gtgagttcaa gcatagtaat tatcc 

<210> 282 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2098 reverse primer 

<400> 282 

attcagcctc agttcactgc ttc 

<210> 283 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S518 forward primer 

<400> 283 

gatccagtgg agactcagag 

<210> 284 

<211> 20 

<212> DNA 

<213> /Artificial Sequence 



<220> 

<223> D17S518 reverse primer 

<400> 284 

tagtctctgg gacacccaga 

<210> 285 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



BHC03 1 051 



<220> 

<223> D17S518 forward primer 

<400> 285 

attcctgagt gtctaccctg ttgag 

<210> 286 

<211> 17 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S518 reverse primer 

<400> 286 
actgactgcg ccactgc 

<210> 287 

<211> 20 

<212> DNA ' * " 

<213> Artificial Sequence 



<220> 

<223> D11S4358 forward primer 

<400> 287 

tcgagaagga caaaatcacc 

<210> 288 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<223> D11S4358 reverse primer 

<400> 288 

gaacagggtt agtccattcg 

<210> 289 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S964 forward primer 
<400> 289 



BHC03 I 051 



gttctttcct cttgtgggg 
<210> 290 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S964 reverse primer 

<400> 290 

agtcagctga gattgtgcc 

<210> 291 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D19S1091 forward primer 

<400> 291 

caagccaaga catcccagtt 

<210> 292 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D19S1091 reverse primer 

<400> 292 

ccccacacac agctcatatg 

<210> 293 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1179 forward primer 
<400> 293 

ttttctctct cattccattg gg 
<210> 294 

<211> 20 



<212> DNA 



BHC03 1 051 



<213> Artificial Sequence 



<220> 

<223> D17S117 9 reverse primer 

<400> 294 

gcaacagagg gagactccaa 

<210> 295 

<211> 19 

<212> X>NA ... 

<213> Artificial Sequence 



<220> 

<223> D10S2160 forward primer 

<400> 295 

tcccatcccg taagacctc 

<210> 296 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D10S2160 reverse primer 

<400> 296 

tatggagtac ctactctatg ccagg 

<210> 297 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S123 0 forward primer 

<400> 297 

attcaaagct ggatcccttt 

<210> 298 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> D17S1230 reverse primer 
<400> 258 

agctgtgaca aatgcctgta 
<210> 299 

<211> 20 



<212> DNA 



<213> Artificial Sequence 



<220> 



<223> D17S1338 forward primer 
<400> 299 

tcacctgaga ttgggagacc 
<210> 300 



<211> 18 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S133 8 reverse primer 

<400> 300 
aagatggggc aggaatgg 

<210> 301 

<211> 19 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2011 forward primer 

<400> 301 

tcactgtcct ccaagccag 

<210> 302 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2011 reverse primer 
<400> 302 

aaacaccaca ctctcccctg 
<210> 303 
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<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S2011 forward primer 

<400> 303 

ttcttgggct tcccgtagcc 

<210> 304 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D17S2011 reverse primer 

<400> 304 

ggggcagacg acttctcctt 

<210> 305 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S203 8 forward primer 

<400> 305 

ggggatacaa cctttaaagt tec 

<210> 306 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S203 8 reverse primer 

<400> 306 

attcacctaa tgaggattct tcttt 

<210> 307 

<211> 24 

<212> DNA 



<213> Artificial Sequence 



BHC03 1051 



<220> 

<223> D17S2091 forward primer 

<400> 307 

gctgaaatag ccatcttgag ctac 

<210> 308 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S2091 reverse primer 

<400> 308 

tccgcatcct ttttaagagg cac 

<210> 309 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S649 forward primer 

<400> 309 

ctttcactct ttcagctgaa gagg 

<210> 310 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S649 reverse primer 

<400> 310 

tgacgtgcta tttcctgttt tgtct 

<210> 311 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> D17S1190 
<400> 311 



forward primer 
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gtttgttgct atgcctgc 
<210> 312 

<211> 18 

<212> DMA 

<213> Artificial Sequence 



<220> 

<223> D17S1190 reverse primer 

<400> 312 
caacacacta ccccagga 

<210> 313 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> M87506 forward primer 

<400> 313 

actcctcatc tgtagggtct 

<210> 314 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<400> 314 

gagtccgcta cctgagtgct 
<210> 315 

<211> 4617 

<212> DNA 

<213> Homo sapiens 



<400> 315 

gcctggagac taacagctgc 
ggggcggccg gcttctgctg 
tgcaggactt ggtccctctg 
agcaacctcc cccgcccccg 
ggaagggctc cttcaaaatc 
gctccggcgg tggggatggg 
cgagcctgac agacatggga 
agttgacaag aactcaaagt 
aaactgtgtc gcttgtggat 
caactccccc tcctcctccg 
tgcctacatc tgtccttgtg 
cgcctcctcc accccatgcc 



tcggaagagg agctcagcag 
cagcccccag gccctgaatt 
gggcgcctga gtagagggga 
cctcctcccg ggcccctccg 
cgcctcagtc gcctctttcg 
accggcaaga ggccttctgg 
ggctctgcgg gccgggagct 
gccttttctc cggtctcctt 
gtggacattt ctcagcgggg 
agaagaagcc tcagcctcct 
gctccgatgg ggtcttcctt 
ccagatgcat ttccccggat 



cccgggtcgc ggaggaggag 
acctccggtg cccttcccgc 
gcagcagcag cagcagcagc 
gccactcgcg ggtccttctc 
caccaagagc tgcaacggtg 
agagctggct gcttcagctg 
ggacgcgggg aggaaaccca 
cagccccctg ttcacaggtg 
cctgacctct ccacaccctc 
agatgatatc agtgggacgc 
gcagtctttc cccctacctc 
tgctcccatc cgagcagctg 
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aatccctgca 
gcagctttgc 
attgggaaga 
acagttctga 
acactagaat 
accgctgtca 
agtttctcta 
tctatccagt 
tacgacagct 
cttatatccg 
aagcgcagct 
cctgctggtc 
agctaccatg 
agactttggt 

ggggctccac 

tggaagaagt 
tggtatggaa 
gcgaagggtg 
ctggtgccaa 
tgaggtttgc 
ggcagctgtt 
caaaacctag 
gtttaggaac 
aattccttaa 
ccccttgaga 
ccagctgtca 
gctgaattta 
agtgagtgtt 
aatacagctc 
gatttcctta 
ccacttctgg 
ccagacattt 
gatttgagag 
ttaatggttt 
tagagctgct 
atgactcctt 
cctcgtgtca 
gccccagtgt 
gtcccacgtg 
ttacctgact 
tctgatttgt 
gcaagattgt 
taatataact 
tgttgagatc 
caaccaccca 
attgtagaaa 
ctgctgcaca 
agggtagacc 
ggcaggacag 
tggcatgtac 
gtgcggtttt 
tgatggtgag 
gcctcactat 
actggggttt 
gtgacccaac 
gtgtcagtgc 
gcaaaacagc 

gggtggcctt 

tctctgggat 
tgcccttcac 
taaatgtatt 
ggaggctgag 
aaaccctatc 
ccagctactt 
tgagccgaga 
<210> 316 



cagccaaccc 
agccagcctt 
tgcagagatg 
tcctcgttac 
ggagcactac 
atctgttgta 
tttcttaaga 
gtcccgattc 
cgtcaggata 
aaagttctac 
catttccaaa 
accaccaagg 
aaaagaagag 
tccccacgca 
ctcacaccca 
ctcaacactg 
aactcacctt 
agcaggtcca 
aggcagagtt 
aaaggactgg 
gtaggagcag 
ccaattggcc 
ctgagttcag 
gttccatgac 
taactgtaca 
tggcaatcgt 
atggagcatg 
acttccatgt 
ccacagttgt 
ggtcagagga 
gatgggaggt 
ccggatcaga 
ctgcctgtcc 
ctgcagctgg 
ctggggagag 
tgcgtgtgtt 
caagacagat 
cctggtaatt 
gagtgaccag 
gtggaagggt 
catgtggtac 
gatcccagaa 
cgccatcttg 
cttgggtggc 
taattttaat 
acaaaaaagg 
tcaagcagaa 
agatggtctg 
acagtgagat 
tgaacagtgc 
ctagatggaa 
gtggagccca 
ggtcagctca 
cttctggaga 
ctcaccaccc 
actggtccca 
agcgcaagag 
gcacagagca 
acagcctggt 
ctccctgacc 
cctttctagg 
gtgggtggat 
tctactaaaa 
gggaggctga 
ttgcgccact 



ccacagcacc 
cgagagttgg 
aagctgaaag 
atcctgagcc 
agaggaacct 
gagtttatta 
tccagggttc 
agcaatgtca 
gatcacatcc 
tactatgatc 
cagaagcaag 
gcatttggtt 
gaaaagtgag 
gccctggggc 
cccctgggca 
tttctttttc 
gaaggcagtt 
aagaaggggt 
agagtctgtg 
gagttcctga 
gagagaaagg 
ttagctggga 
aaacacctct 
ctgaagttaa 
tgtcactctg 
gtttcccatc 
tttccaggtt 
gcctgtcagc 
gctcttccta 
gagcatcgca 
agcaagggct 
tcctgctggt 
ctgctttcag 
ccatggccaa 
gtttgctatt 
ccatgggact 
actaatgtca 
tataggactg 
tcttcctcct 
ggcccacaag 
ctcttctttc 
attggcttag 
caggaagtgc 
tgatatacag 
attatttttt 
taaaagacgt 
aatcaaggac 
ggaaagttct 
gaaaaacaca 
acagccctgt 
gcttcccagc 
aagcccaaag 
ggccatgtga 
actcatacat 
acctttctcc 
gccctggccg 
gagacagtct 
gttaggggaa 
cagagagtaa 
aggcccgtcc 
tctgggcgcg 
catgaggtcg 
tacaaaaatt 
gacaggagaa 
gccctctagc 



tccagtgtcc 
agaagtgtgg 
ggaaaccaga 
tcagtttccg 
tcagcctgtg 
agagagccat 
caggactgcc 
aatccctcca 
cagatctccc 
ctcaggaaga 

aggtggaacc 

gccaagctcc 
ggaacaggaa 
ttggaagaag 
tcttaggact 
agaagttttg 

ggggtttgtg 

gggaggaggg 
ctgtgggcct 
ggaaggggga 
agggtgggtg 
gaagtagtga 
catggaagct 
ccccgttctt 
atcatggtaa 
acctgggcgg 
cttcatggca 
ttgtgagggg 
gaggaagctc 
tctcacgttt 
tctgtatttt 
ctccactcac 



gaggagcggg 
ttcatatgac 
gagatgtaac 
ctctttctgg 
ggtttgtggc 
cctcatctgg 
tgtagctgaa 
atgagctgtg 
cttggcttcc 
catgtgagtg 
caccccaata 
cctgggatct 
agtgtgtgtg 
aatatgtggc 
tatctaaaga 
gtgcctctga 
tgagcaaaag 
tagagcaaca 
caccaggcag 
gagtcagcaa 
gggaggcaga 
tcaggtacaa 
tcctggggag 
cagcctagtc 
tcgtcaggtg 
gatgaggggg 
atggagcaga 
ctccttcgcc 
gtggctcacg 
ggagttcaag 
agctgggcat 
ttgcttgaac 
ctagacgaca 



cctctaccgg 
ttggtattgg 
tggttctttc 
atcacagggt 
gtgtcatccc 
tatgcactcc 
accaactcct 
gcacctttgc 
actgcctaaa 
ggtatacctg 
ctccacgtag 
agctttgaag 
ggttgggatt 
cacatgaccg 
ggaggggctc 
tttttgatat 
cccgttggat 
aacaggggac 
ggaagatggg 
atctgcctct 
gtctcgaaaa 
ctcctgcatc 
gtactagttg 
cctctgctct 
cagcatccct 
ttcagagcca 
aactgtactc 
gaatgtggag 
tcagaacgca 
ttaggtttat 
cttgtgttca 
tggaaagtct 
gagaaaaact 
attgtgagtt 
agtggagctg 
gttccccatg 
ttcctgatgg 
gagcattgcc 
cagggaggaa 
caccataaac 
atggtagtat 
ttgcctcgtg 
tagagcctga 
ttcttttttt 
tgcctggctt 
ctaagggagc 
cgtttatagt 
ggctttgggt 
caaggaagca 
tggttaaaga 
acctgagtgc 
ggctcctgcc 
gcctccgcac 
aacaaaccaa 
tgtcgtgttg 
ctttctctgt 
tgggtgtttg 
aggcatgggg 
ggaacaggtg 
agtgctgtgc 
cctgtaatcc 
accagcttgg 
ggtggcatgc 
ccaggaggcg 
gagtgagact 



cctgactcga 
gggccaatga 
ctggtacgag 
atcacccacc 
aagtttgagg 
aagaatggaa 
gtccagctgc 
agattccgga 
cctctgatct 
tctctaaagg 
cgaggggctc 
aaccaaatta 
ctctgtgcag 
tactctgcgt 
cttggaaaac 
ttatattact 
tgaaagtggt 
ggccattcag 
aggaggggct 
tgttgccatg 
gaatattggg 
cttttttaag 
tgatttactt 
caacccattg 
attgcttctg 
gtcatgggct 
atgacttagg 
gaaggtgaga 
gccctcacgg 
cactgccatc 
ttctagcaac « 
gccagatgcc 
ccaatggtct 
tgctttctta 
ttgggtcttc 
cttatagttg 
tttgggtggg 
ttcttcctta 
acttgcacca 
acagcccacc 
taccaactaa 
agagtacaag 
agttggaatc 
tgttcctttt 
tgcgctagat 
ttttaggtga 
agataagatc 
tgtagtcaat 
gaaatctgca 
atcctttcca 
cgaggggtta 
cattgccagg 
cccctgtgtt 
ctgaggaggt 
aactgtgtct 
ggagtgggct 
gtgcctgcat 
ctgggccagc 
tggcagcacc 
agagctcatt 
cagcactttg 
ccaaacagtg 
gcctgtagtc 
gaggttgcac 
ccatctc 



78C 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 80 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
. 2520 
2580 
2640 
2700 
2760 
, 2820 
.2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4617 



BHC03 1 051 



156 



<211> 1713 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 



gagcacgtgc gcgtcctcgc tgctcacatt tcggcggagg gcgcgctccc tggaaaattc 
cactcctgtg ctagctccac cctatgcggc ttttctccta cccgacgctc ttcactctca 
gctcccttcc cggcggcctt tgcgggaaca agatggcagc ccccatacct caagggttct 
cttgtttatc gagggttttg ggctggtggt ctcggcagcc agttctggtg actcagtccg 
cagctatagt tccagtaaga actaaaaaac gtttcacacc tcctatttat caacctaaat 
ttaaaacaga aaaggagttt atgcaacatg cccggaaagc aggattggtt attcctccag 
aaaaatcgga ccgttccata catctggcct gtacagctgg tatatttgat gcctatgttc 

ctcctgaggg tgatgcacgc atatcatctc tttcaaagga gggactgata gagagaactg 480 

aacgaatgaa gaagactatg gcatcacaag tgtcaatccg gaggataaaa gactatgatg 540 

ccaactttaa aataaaggac ttccctgaaa aagctaagga tatctttatt gaagctcacc 600 

tttgtctaaa taactcagac catgaccgac ttcatacctt ggtaactgaa cactgttttc 660 

cagacatgac ttgggacatc aaatataaga ccgtccgctg gagctttgtg gaatctttag 720 

agccctctca tgttgttcaa gttcgctgtt caagtatgat gaaccagggc aacgtgtacg 78<M 

gccagatcac cgtacgcatg cacacccggc agactctggc catctatgac cggtttggcc 840 

ggttgatgta tggacaggaa gatgtaccca aggatgtcct ggagtatgtt gtattcgaaa 900 

agcagttgac aaacccctat ggaagctgga gaatgcatac caagatcgtt cccccatggg 960 

caccccctaa gcagcccatc cttaagacgg tgatgatccc tggccctcag ctgaaaccag 102 0 

aagaagaata tgaagaggca caaggagagg cccagaagcc tcagctagcc tgatgacaaa 1080 

aatgacttct agggtgaagc ctgggtgatg aggctgctgg aagctttgaa gtctcccatt 1140 

cccctcatgc tataaaaaga actacctttg ttctctccca tcctgctcag gtcttttcag 1200 

cagtctcatc atcagcaacc atgactgatg actgggccct agcaggtggc aggtataaca 1260 

tggccatgga cactcttctt ttttaaattt tatgtctagc ttctgagtct agatgaaaga 132 0 

cagtatgttt cagagaacat tggatatcag tttttcccac agcagggact gtgagagaca 1380 

accagcagca tcctctttgt aatcacaggg cagggatcag agtttgaaat gaaatgttgt 1440 

cagggtgttg gaaaaatttt ggtgagttct gcacatttcc cctggttcag gctgggcatg 1500 

gaccagcctt cagatggcag aagtggaaga tgagcctact tgtgagcgat gtgactttaa 1560 

ggaaatgaag actggggaag aataattagt gtttataaga catttaagag gccctttttc 1620 

atatactgac tcactgatga atcagcattt gcattttatg gaaaaatata aatccaaaga 1680 
aataatttat cccttaaaaa aaaaaaaaaa aaa 
<210> 317 

<211> 5632 

<212> DNA ^ 
<213> Homo sapiens 



<400> 317 

gtggcgatca tggcaagtta gaagttttct gactcctttc ggaggagcct ccgggacccc 60 

ggggagtaac aggtgtctgg aggctgaagg gtggaggggt tcctggattt gggttttgct 120 

tgtgaaactc ccctccaccc tcctctctcg cacccaccca ccccctcacc cccttctttt 180 

tccgtccttg gaaaatggtg tccaagctca cgtcgctcca gcaagaactc ctgagcgccc 240 

tgctgagctc cggggtcacc aaggaggtgc tggttcaggc cttggaggag ttgctgccat 300 

ccccgaactt cggggtgaag ctggagacgc tgcccctgtc ccctggcagc ggggccgagc 360 

ccgacaccaa gccggtcttc catactctca ccaacggcca cgccaagggc cgcttgtccg 420 

gcgacgaggg ctccgaggac ggcgacgact atgacacacc tcccatcctc aaggagctgc 480 

aggcgctcaa caccgaggag gcggcggagc agcgggcgga ggtggaccgg atgctcagtg 540 

aggacccttg gagggctgct aaaatgatca agggttacat gcagcaacac aacatccccc 600 

agagggaggt ggtcgatgtc accggcctga accagtcgca cctctcccag catctcaaca 660 

agggcacccc tatgaagacc cagaagcgtg ccgctctgta cacctggtac gtcagaaagc 720 

aacgagagat cctccgacaa ttcaaccaga cagtccagag ttctggaaat atgacagaca 780 

aaagcagtca ggatcagctg ctgtttctct ttccagagtt cagtcaacag agccatgggc 840 

ctgggcagtc cgatgatgcc tgctctgagc ccaccaacaa gaagatgcgc cgcaaccggt 900 

tcaaatgggg gcccgcgtcc cagcaaatct tgtaccaggc ctacgatcgg caaaagaacc 960 
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ccagcaagga 
gaggggtgtc 
tctacaactg 
acgcctatag 
accaccagcc 
gaaacaatga 
ccagccagtc 
tctcacctga 
tgacgaatat 
cacccctctc 
tccctgtcat 
agcagctgca 
cccagcagcc 
agcaggaacc 
ataccagcag 
cctggtgatg 
ccatcaccct 
ggtccctgct 
cagcccgaag 
aaggactcct 
gccgtactgt 
tactgaaatg 
ccacagactc 
aatttgtcca 
tcctctcccc 
cagagtcaaa 
Sggaagtgaa 
ttggactgac 
gaggggctga 
ccgtgcactc 
agttgcagtc 
cgatcatggc 
agtaacaggt 
aaactcccct 
tccttggaaa 
gagctccggg 
gaacttcggg 
caccaagccg 
cgagggctcc 
gctcaacacc 
cccttggagg 
9gaggtggtc 
cacccctatg 
agagatcctc 
cagtcaggat 
gcagtccgat 
atgggggccc 
caaggaagag 
ggtgtccccc 
caactggttt 
ctatagctcc 
ccagcccagc 
caatgagatc 
ccagtcggtt 
acctgatggt 
gaatatccac 
cctctctgga 
tgtcatcaac 
gctgcacagc 
gcagcccttc 
ggaacccccc 
cagcagcatc 
gtgatgccca 
caccctctgg 
cctgcttacc 
ccgaagccca 



agagagagag 
cccctccaaa 
gtttgcaaac 
ctccaaccag 
cagctcctct 
gatcacttcc 
ggttttacag 
tggtaaaatg 
ccacagcctc 
tggagtcatg 
caacagtgtg 
cagccctcac 
cttcatggca 
cccccagtat 
catcagtaca 
cccacacacc 
ctgggcagct 
tacctgacgg 
cccagtttcc 
gtctgtcctg 
ctatgttgtg 
aggagagaag 
cctcccctgc 
atggctgtaa 
tctgccctgt 
atttcaaaag 
gccccttgtc 
taccagctaa 
atgtagtgtg 
cctaagaaac 
tcagaaagcc 
aagttagaag 
gtctggaggc 
ccaccctcct 
atggtgtcca 
gtcaccaagg 
gtgaagctgg 
gtcttccata 
gaggacggcg 
gaggaggcgg 
gctgctaaaa 
gatgtcaccg 
aagacccaga 
cgacaattca 
cagctgctgt 
gatgcctgct 
gcgtcccagc 
agagaggcct 
tccaaagccc 
gcaaaccgca 
aaccagactc 
tcctctcctc 
acttcctcct 
ttacagcaag 
aaaatgatct 
agcctctccc 
gtcatggcaa 
agtgtggccg 
cctcaccagc 
atggcagctg 
cagtattccc 
agtacactca 
cacaccactt 
gcagctgtca 
tgacggacgt 
gtttcccttc 



gccttagtgg 
gcccacggcc 
cgcaggaagg 
actcacagcc 
cctccaaaca 
tcctcaacaa 
caagtctccc 
atctcagtct 
tcccaccata 
gcaattgcac 
gccggcagcc 
cagcagcccc 
gctgtgactc 
tcccacacct 
ctcaccaaca 
acttacttcg 
gtcatggaaa 
acgtcctgct 
cttctatgca 
gaggtgggag 
atccttcatc 
gacatgtgct 
ccccatccca 
actataaact 
atataatact 
tgatctgtcc 
ctctcccgcg 
ctccagtctc 
tcagagtaac 
cccagggcag 
ttccattaaa 
ttttctgact 
tgaagggtgg 
ctctcgcacc 
agctcacgtc 
aggtgctggt 
agacgctgcc 
ctctcaccaa 
acgactatga 
cggagcagcg 
tgatcaaggg 
gcctgaacca 
agcgtgccgc 
accagacagt 
ttctctttcc 
ctgagcccac 
aaatcttgta 
tagtggagga 
acggcctggg 
ggaaggagga 
acagcctgaa 
caaacaagct 
caacaatcag 
tctccccagc 
cagtctcagg 
accataatcc 
ttgcacaaag 
gcagcctggc 
agcccctcat 
tgactcagct 
acacctcccg 
ccaacatgtc 
acttcgtgcg 
tggaaaagcc 
cctgctggca 
tatgcagtat 



aggaatgcaa 
tgggctccaa 
aggaggcatt 
tgaaccctct 
agctgtcagg 
tcagtcacca 
cagccagcct 
caggaggagg 
atccccagca 
aaagcctcaa 
tggcagccct 
tcatgcagca 
agctgcagaa 
cccggtttcc 
tgtcttcaag 
tgcgcaacaa 
agcccagtga 
ggcacctcag 
gtattgccac 
acaaggaacc 
gaacaaactg 
attgaactga 
aatgatcttg 
actgtaatta 
aaagtgtcta 
cctctcccct 
aggcctggac 
ctgacattaa 
atgccagctt 
ggaaactggc 
acaatttatt 
cctttcggag 
aggggttcct 
cacccacccc 
gctccagcaa 
tcaggccttg 
cctgtcccct 
cggccacgcc 
cacacctccc 
ggcggaggtg 
ttacatgcag 
gtcgcacctc 
tctgtacacc 
ccagagttct 
agagttcagt 
caacaagaag 
ccaggcctac 
atgcaacagg 
ctccaacttg 
ggcattccgg 
ccctctgctc 
gtcaggagtg 
tcaccatggc 
cagcctggac 
aggaggtttg 
ccagcaatct 
cctcaacacc 
agccctgcag 
gcagcagagc 
gcagaactca 
gtttccatct 
ttcaagtaaa 
caacaacaag 
cagtgacctg 
cctcagacaa 
tgccacaatg 



cagggcagaa 
cttggtcact 
ccggcaaaag 
gctctcccac 
agtgcgctac 
tggcaacagc 
ggacccaggc 
tttgccccca 
atctcaaaac 
cacctcccaa 
gcagcccgtc 
gagcccaggc 
ctcacacatg 
atctgcaatg 
taaacagtgt 
caaggaccct 
cctgaccagc 
acaatccact 
aatgcctctc 
tccgaagagg 
atgcgaaaac 
gccaaacaca 
agatttcttt 
agtgcaattt 
ttagttttct 
catggagaaa 
acttatgggg 
gacacacctc 
cctgtgggcc 
tgtttgatag 
ttatcactaa 
gagcctccgg 
ggatttgggt 
ctcaccccct 
gaactcctga 
gaggagttgc 
ggcagcgggg 
aa gggccgct 
atcctcaagg 
gaccggatgc 
caacacaaca 
tcccagcatc 
tggtacgtca 
ggaaatatga 
caacagagcc 
atgcgccgca 
gatcggcaaa 
gcagaatgtt 
gtcactgagg 
caaaagctgg 
tcccacggct 
cgctacagcc 
aacagcgcca 
ccaggccaca 
cccccagtca 
caaaacctca 
tcccaagcac 
cccgtccagt 
ccaggcagcc 
cacatgtacg 
gcaatggtgg 
cagtgtcctc 
gaccctgttt 
accagcacct 
tccactctca 
cctctcccac 



tgtttgcagc 
gaggtccgtg 
ctggccatgg 
ggctcccccc 
agccagcagg 
gccatggtga 
cacaatctcc 
gtcagcacct 
ctcatcatga 
gcacagagtg 
cagttctccc 
agccacatgg 
tacgcacaca 
gtggtcacag 
cctctacaag 
gttttccaca 
acctgcgaga 
ctcaggagcg 
ccacgatgtc 
aagcaagaaa 
ttgaatctgt 
ctgtaaatat 
taaagaagta 
cccctctgtg 
ttgtaaaggt 
catcctaagt 
acagcatacc 
tggatccctg 
aggagctcag 
cagaagaaaa 
aaaaaagtgg 
gaccccgggg 
tttgcttgtg 
tctttttccg 
gcgccctgct . 
tgccatcccc 
ccgagcccga 
tgtccggcga 
agctgcaggc 
tcagtgagga 
tcccccagag 
tcaacaaggg 
gaaagcaacg 
cagacaaaag 

at gggcctgg 

accggttcaa 
agaaccccag 
tgcagcgagg 
tccgtgtcta 
ccatggacgc 
ccccccacca 
agcagggaaa 
tggtgaccag 
atctcctctc 
gcaccttgac 
tcatgacacc 
agagtgtccc 
tctcccagca 
acatggccca 
cacacaagca 
tcacagatac 
tacaagcctg 
tccacaccat 
gcgagaggtc 
ggagcgcagc 
gatgtcaagg 



102C 
108C 
114C 
120C 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
, 2520 
. 2580 
; 2640 
2700 
2760 
2820 
.! 2880 
2940 
. 3000 
■ 3060 
, 3120 
; 3180 
' 3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
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actcctgtct 

tactgtctat 

gaaatgagga 

agactccctc 

tgtccaatgg 

ctcccctctg 

gtcaaaattt 

agtgaagccc 

actgactacc 

ggctgaatgt 

gcactcccta 

gcagtctcag 

<210> 318 



gtcctggagg 
gttgtgatcc 
gagaaggaca 
ccctgccccc 
ctgtaaacta 
ccctgtatat 
caaaagtgat 
cttgtcctct 
agctaactcc 
agtgtgtcag 
agaaacccca 
aaagccttcc 



tgggagacaa 
ttcatcgaac 
tgtgctattg 
atcccaaatg 
taaactactg 
aatactaaag 
ctgtcccctc 
cccgcgaggc 
agtctcctga 
agtaacatgc 
gggcagggaa 
attaaaacaa 



- 158- 

ggaacctccg 
aaactgatgc 
aactgagcca 
atcttgagat 
taattaagtg 
tgtctattag 
tcccctcatg 
ctggacactt 
cattaagaca 
cagcttcctg 
actggctgtt 
tttattttat 



aagaggaagc 
gaaaacttga 
aacacactgt 
ttcttttaaa 
caatttcccc 
ttttctttgt 
gagaaacatc 
atggggacag 
cacctctgga 
tgggccagga 
tgatagcaga 
cactaaaaaa 



aagaaagccg 
atctgttact 
aaatatccac 
gaagtaaatt 
tctgtgtcct 
aaaggtcaga 
ctaagtggga 
cataccttgg 
tccctggagg 
gctcagccgt 
agaaaaagtt 
aa 



4980 
5040 
5X00 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5632 



<211> 3123 

<212> DNA 

<213> Homo sapiens 



<400> 318 

gaactgtggc 

gggggccaaa 

aaggaaatat 

gaagtctgtc 

gaaaaaagaa 

gatgacttca 

gtgattcacc 

agccagaaat 

caaagattga 

atctcagaaa 

ccgatcttcc 

ctcgactact 

aagccatccc 

gaaaaacatt 

gcttcagagc 

taataaaaat 

cagccaagaa 

atcgactaat 

ggcttgtagt 

agctggcttc 

caacttttgc 

tgtccattgg 

gatctgagac 

ctgttcttgt 
- 1 -» »- 

cagtccacag 
gagggattga 
tggaatatat 
catttttcac 
ctgggtgtcc 
aaaagatgat 
tagaaaaagc 
ctcttgaaga 
tgatcccagc 
aacatttgaa 
tcagtttcaa 
gtccaggctg 
actgagatta 
acaatggaaa 
agaaaaacaa 
tcccttcaag 
tcaaaataaa 
gagctgtcta 



gctttctggg 
ttcgacacga 
gactttgatt 
ccaggtgtgt 
gagagcctaa 
gagacagggt 
caccttgggc 
tgcttcccaa 
agacaaaaaa 
agaaaagata 
tgacccaatt 
tcagaacatt 
agttatgctg 
agcttttagc 
cctgattata 
ttctgaggga 
atttggacct 
ctatttatta 
agacgaatca 
cattttcctg 
atatgatgtt 
agcaaggaat 
cggaaaactt 
ttttgttcag 

tttcagagca 
ttttaaaggt 
ccacaggata 
tgaggatgat 
tgtaccagaa 
taagaaacca 
taaggataaa 
caaaagttaa 
atgaatgtta 
atcaactaca 
tttgtaggtg 
gtcttgaact 
caggcacaag 
taacattcat 
atgtttatgc 
tgtgaaggaa 
gccttgacat 
ccaagaagaa 



taaagatgga 
gacgcttctc 
cttcggaggt 
gtggagcatc 
ctgaaaggaa 
ttcaccatgt 
tcccaaagtg 
gaagaaggtg 
gttcagagag 
aacttcttgc 
gctacatttc 
ctagatgcag 
catggtcggg 
attcctattt 



cgtccacgat 
ggcagacgca 
gcttcaggga 
acaaacacat 



tcaccaacac 
acaggattca 
aaatcatcta 
aagcaagatc 
gataaactgt 
gcctgcacat 
gaacagtggt 
tctgcagtag 
ctggccgtga 
tccattgaaa 
gttattcatg 
ggaaaaatct 
gtgaacttgg 
ggtcgaactg 
aagccattat 
tacataaaag 
ttggaaaggg 
cagagaaagg 
aaacagactt 
ttttcatgga 
agtacatggg 
cctttttttt 
cctgacctca 
ctgctgcacc 
tgacatttct 
taaaaataac 

ggtgtgatga 

taaatacccc 
acctgctgct 



gagggagcag 
tggccagtat 
ctgggattac 
ctactataca 
aaagtaaact 
ggaataaaca 
agcaacttga 
gcttccaaat 
aacttctggc 
taatgcagct 
gagaacttgc 
gaatacacat 
aaaagtttga 
cccccggaat 
ttgaagatgg 
cccacaaggt 
gcaaactcaa 
aaactgtaga 
gagaacttgt 
gggctaaaga 
cagagagaac 
gggttctgat 
tgatcaacta 
gaagagcagg 
taagaagcgt 
gttttcagaa 
agagcattag 
tcactggtca 
taaaaatact 
atacttgaag 
actggtgata 
ttcctgtaga 
cacaatcctc 
cagctctgta 
gtggtttgaa 
accaaaatgt 
atgctgtgga 
ttccactgct 
ctcttaattt 



ctctttcgcc 
gctcgattcc 
ctggactttt 
cagaagcccc 
agcaagaaaa 
ggtctcgatc 
agatgtgagc 
gtggatgtca 
aacttccgga 
caaaattcac 
ccaggaatat 
gcctacgcca 
ttctgctcca 
gaaacaaccc 
cagccagatt 
gatccacaaa 
tattcttgtg 
cgacctagca 
caaaactggg 
ccgaagagct 
cctggacaat 
acaagagctt 
taaaaagggt 
actttttcat 



acaacaacag 
ttgtacagcc 
tgactttcca 
gaataaggga 
tgctaatgtt 
actactaagc 
tacaactcca 



gaacagcaag 
gtcccagaaa 
tcttacagtc 
aatgatccta 
gatgagggtc 
ctgccttagc 
ggtgactttt 
tccagggaga 
ggtgaactct 
gaggcatctg 
cactttgtgg 
taatatttcc 



ggctcggcgc 60 

agataggaaa 120 

ttggaaacaa 180 

aaaatggaga 240 

aaaggaagac 300 

tcctgacctc 360 

caccacgccc 420 

tctgtagaag 480 

aagttggaga 540 

gtccaaggaa 600 

aaaatcaatt 660 

atccaaatgc 720 

actggatctg 780 

gcaaataaag 840 

cacagagagt . 900 

gcagcagtgg 960 

actactccaa 1020 

agtgttgagt 1080 

ttcagagacc 1140 

atgttcagtg 1200 

gtcatcagtg 126^ 

ctctttgttg 132^| 

ttcaatccac 1380 

gagctcatat 1440 

agagataaca 1500 

ttgctagcaa 1560 

actagctcag 1620 

aaagcaatta 1680 

atacagcagg 1740 

aaacaaaaga 1800 

aaatgtttct 1860 

aagaaagtag 1920 

tgtaatttta 1980 

acctgtacca 2040 

aactatcaag 2100 

ttgccatgtt 2160 

ctcctgagta 2220 

aaatgattat 2280 

tacttcttat 2340 

taaggacttt 24 00 

gaacagaaat 2460 

atggtagcat 2520 

taatttgttg 2580 
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SSSSSi ssss- SS3SSS SS3»* 

S! ~ ?w si H™1 ilSl 

si If « iss ss ieI 

S23S ESSSS 252s »~ ~| 



<211> 1817 

<212> DNA 

<213> Homo sapiens 



<400> 319 
caaccatcct 
agcttccaca 
cagcgcctcg 
tccaactcca 
gttggcaatg 
aaggtgcgga 
gaaaccaacg 
ctgcgaagtc 
aatgctaaac 
ctaacagtgg 
aaaacagatt 
gagcatcagg 
gttgatgctg 
gaagtcatgg 
gttctgcagc 
acggagctga 
aaagagtctt 
aacctccagt 
gaacgccaga 
attgctactt 
agcaccctgg 
gtagtggatg 
gctaccagaa 
ctccctgcaa 
gtggggtggc 
aggagctgca 
attactgcca 
tttgtaggta 
agaggaatcc 
aaaaaaaaaa 
aaaaaaaaaa 
<210> 320 



gaagctacag 
gaagcctgag 
ggacgacacc 
gacacacggt 
agaaaatggc 
ccctggagca 
ccccgagggc 
agattaagga 
tggctgctga 
aagctgatct 
tggagattca 
aggaagtcga 
ctccaggcct 
cccagaagaa 
aacaggtcac 
gacgcacctc 
tggagcacac 
cgctgttgag 
acaacgaata 
accgccgcct 
aagagagaga 
gcaaggtcgt 
ggagatgctg 
gaaatcagcc 
gggaatccta 
aattgcctgc 
tctttatcat 
atgttcctga 
atgtaaattg 
aaaaaaaaaa 
aaaaaaa 



gtgctccctc 
ctcctccttg 
cagcgtttat 
gaactatggg 
catgcagaac 
gtccaactcc 
tggtcgcgac 
tgctcaactg 
ggacttcaga 
ccaaggcctg 
aattgaagaa 
tggcctacac 
gaaccttggc 
ccttcaagag 
agtgaatact 
ccagagcctt 
tctagaggag 
ctctctggag 
ccatatcctt 
tctggaagga 
tataaagaaa 
gtcatctgaa 
ctgaggtttt 
ataagaaagc 
tttatcagac 
aaaaatgaaa 
gaagcacatc 
aaattgcaat 
caaataaacc 
aaaaaaaaaa 



ctggaatctc 
caggcccctg 

gggggtgctg 

agcgatctca 
ctaaatgacc 
aaacttgaag 
tacagtgcat 
caaaatgctc 
ctgaagtatg 
aataaggtct 
ctgaataaag 
aagcatctgg 
gtcatcatga 
gccaaagaac 
gaagaattaa 
gagatagaac 
accaaggccc 
gcccaactga 
cttgacataa 
gaagacgtaa 
accaggaaga 
gtcaaagagg 
gaaagaaatt 
actattaata 
tctgtaattg 
tccagtgagc 
aattacaagc 
acatttcaat 
actttctaat 
aaaaaaaaaa 



caatggattt 
tagtcagtac 
gaggccgggg 
caggcggcgg 
gtctagcgag 
tgcaaatcaa 
attacagaca 
ggtgtgtcct 
agactgagag 
ttgatgacct 
acctagctct 
gcaacactgt 
atgaaatgag 
agtttgagag 
aaggaactga 
tccagtccca 
gttacagcag 
tgcagattcg 
agactcgact 
aaactacaga 
ttaagacagt 
tggaagaaaa 
tggctataat 
ctctgcagtg 
aatataaatg 
actagaatat 
tgtagaccac 
tatactaaac 
tttttcctgt 
aaaaaaaaaa 



cagtcgcaga 
agtgggcatg 
catccgcatc 
ggacctgttt 
ctacctagaa 
gcagtggtac 
aattgaagag 
gcaaattgat 
aggaatacgt 
aaccctacat 
cctcaaaaag 
caatgtggag 
gcagaagtat 
acagactgca 
ggttcaacta 
tctcagcatg 
ccagttagcc 
gagtaacatg 
tgaacaggaa 
atatcagtta 
cgtgcaagaa 
tatctaaata 
cttatctttg 
attagaaggg 
ttttactcag 
ttaaaacatc 
ctaatatcaa 
ctcacaaagt 
ttctgaaaaa 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 . 
600 
660 
. 720 
780 
840 
900 
: 960 
1020 
-1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1817 



<211> 1474 

<212> DNA 

<213> Homo sapiens 



<400> 320 

ggcagatgaa atataagatt catcaaccac 
tccatcgtcc ctctgcaggt cacagacaca 
gtccggggct gctaacaacg gctacattcc 



atttgacagc ccatggcagg tttcctgttt 60 
cagagcccag ccgtggcagg ctcagccggg 120 
tcccccaggg ccaagggaaa tcctgagcgc 180 
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aaaccaoqqt tgtttggttt tgaggtgtgc tgggatgaaa ggcaccctgg aagtggaagg 
tllggtcKt cattaa?taa tlaStltat aattgagggt ttgttcttaa |agcgagtcc 
tttgaaagta ctttccttca aacagtgact gccacaaagg catcagatat t^ccacctt 
ctcggctgcc tcagcacagc aagctttatt ctgggacctg agatcctgtt ctgagctggc 
«t?lcSc? ccaggctcgc tclccctccc . tttagagata gtggatggta agatgaccaa 
tgc?cagatt attlttctca ttgacaatgc caggatggca gtggatgact tcaacctcaa 
glaatggaga agcatcatgt gccaagtgac ttcaatgtca atgtgaaggt 99^acaggt 
Iccagggaag atctgattaa ggtcctggag gatatgagac aagaatatga gcttataata 
aagaJgaagl atcgagactt ggacacttgg tataaagaac agtctgcagc catgtcccag 
gaggcagcla gtccagccac tgtgcagagc agacaaggtg acatccacga actgaagcgc 
Icfltclagg ccctggagat tgacctgcag acacagtaca gcacgaaatc tgctttggaa 
aacatqttat ccgagaccca gtctcggtac tcctgcaagc tccaggacat gcaagagatc 
Itc^cccact atgaggagga actgacgcag ctacgccatg aactggagcg gcagaacaat 
aaataccaag KSglSgg catlaalacc cacctggaga aggaaatcac cacgtaccga 
?ggc?cc?gg agggagaglg tgaagggaca cgggaagaat caaagtcgag cajgaaagtg 
t??gcaac?I cllagatcaa ggccataacc caggagacca tcaacggaag attagttctt 
tgtcaagtga atgaaatcca aaagcacgoa tgagaccaat gaaagtttcc fcctgttgta 
aaatctattt tcccccaagg aaagtccttg cacagacacc agtgagtgag ttctaaaaga 
tacclSgga Statcagll tcagaaactt ttattttttt tttctgtaac agtctcacca 
aact?"21? aatgctclta ataEattgca cttttctaat caaagtgcga gtttatgagg 
g?aaagct!£ actltcctac tgcagccltc agattctpat cattttgcat ctattttgta 
gccaataaaa ctccgcacta gcaaaaaaaa aaaa A 
<210> 321 W 



<211> 754 
<212> DNA 
<213> Homo sapiens 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



caa^caaacc cataaatacc acagactcta atagccatgg attgctgtgc ctctcgaggc 
tacagtgtcc ccaccgggcc tgccaccacc atctgctcct ctgacaaatc ctgccgctgt 
gag?c?gcc tgcccllcac clgcccacac acagtttggt tactggagcc cacctgctgt 
gSLctitc clccaccctg ccacattcct cagccctgcg tgcccacctg cttcctgctc 
iactcctgcc agccaactcc aggcctggag accctcaacc tcaccacctt * a ^agccc 
tQctgtgagc cctgcctccc aagaggctgc taatggatgg ctactttgct cagtgcctga 
gattSlaaa gtclacatag aalc?ttagc attcacctat ctcagtacct acaactaatg 
?act?tgctt lagaaattgg aacaaggatg gtactaccac aatcaccccc gcaaaaaaa 
aagagaccaa gaaactttca atgaccattc agctataacc aactgcagtt fc 9 a ^cagtg 
aatocctata gcttcctgaa gctgttcgat tccttcatat taaagtgtct ctttctgtgg 
ItggStggg Iattctg?tt tcaltcttgg gtggtatctt tctgaaaatt -aggaagttc 660 
ttStgatta tcctaataaa ttttacatct ctggcatagc acaaaaaaaa aaaaaaaaaa » 

ASkaa. 



aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 
<210> 322 

<211> 749 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



aagaaac^ga aagctaacca gacgcccatt gccatggatt gctgtgcctc tcgcagctgc 
aqtgtcccca ctgggcctgc caccaccatc tgctcctccg acaaatcctg ccgctgtgga 
g?c?5cctgc ccag?acc?g cccacacaca gtttggttac tggagcccat ctgctgtgac 
lactltcclc caccctgcca cattcctcag ccctgcgtgc ccacctgctt cctgctcaac 
tcctgccagc caactccggg cctggagacc ctcaacctca ccaccttcac tcagccctgc 
taSJaccct gcctccclai aggctgctga tggatggcta ctttgctcag tgcccgacaa 360 
caSSatcc Sgaagctgtc ccttcagtat tcacttgcct cagtagtttg ccagatgtta 420 
agg?!Scca gS?gacc?ag atatgaagaa cttacctttg gttttaatgg gggaaaaaaa 480 
gaaaagtatt ttttatggtt atttagctga aaaaccattt ggttcctgtg ggcaggtgaa 



60 
120 
180 
240 
300 



540 
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tgagttttat tagcaaaata ctgtttcaat 
catattgctt cctggctctt gtttcttgta 
gtgggttttc caataaacta tatttctctg 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 
<210> 323 

<211> 440 

<212> DNA 

<213> Homo sapiens 



-161- 

ctttaagacc tcagattaca tgttcttgat 
ctgggtattt tcatagaaga aaatttcttg 
gcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



<400> 323 

gtgttccctg agataggtgg atataaaaga 
cttcctttgc cacaaaacat tgtcctggtc 
tctccaagga tgctgcagcg tccccactgg 
aagctgtcaa tgtgaagtct gcctacccag 
gcccacctgc tgtgaacctg gcccctgcct 
tgttgactgc tcaacaaatg ccacccagct 
tgcatccaga gtgtgaacca ccttgctgct 
gaccttcagt agtcattaag 
<210> 324 

<211> 614 

<212> DNA 

<213> Homo sapiens 



cccatagagg acaaccttgt agaagaaagc 
cactgtcact atgtcttgct gtgattccta 
cctggccacc actatctgcc cctctgacat 
cacctgtcct catgagatca gcctccttca 
ggctgcatgc ctgactccta tgtgccatcc 
ccaaccctga gcgggctctc tgtcaccacc 
agccaaagag cttgcccaca ttaccctgag 



<400> 324 

agacttctct caactcaaca aaaacccacc 
ctcctgcagc gtccccaccg gccctgccac 
ctgtggagtc tgcctaccca gcacctgccc 
ctgtgacacc tgccccccac cctgctgcaa 
gctcaacaac tgtcacccga ctcccggact 
gcctggctgt gagagtccct gtgagccccg 
gtgaggtggc tgcccaatgt cctctgcacc 
atcaaggcta aggccatccc aatccccggg 
tgcacatttg ggtaccattg gagacctccc 
tgagaaataa ccattttgac catttgttaa 
aaaaaaaaaa aaaa 
<210> 325 

<211> 1193 

<212> DNA 

<213> Homo sapiens 



tcccattgcc atgtattgct gtgctctccg 
caccttctgc tcatttgata aaagctgccg 
acatgagatc agcctccttc agcccatctg 
gcctgatacc tatgtgccaa cttgctggct 
gagtgggatc aacctgacca cctatgttca 
ctgttaacca gccgagtctg cacaggttcc 
atctgggctt cagcactcac tactgcctac 
gccaagtctt gatgaatctt cttaattatt 
ttctgtcttt taggctattt catcactctt 
taaactttat tctggcttag caaaaaaaaa 



<400> 325 

cagaaactcc tccaagcaac 
agaccagctt ctgtggatat 
gccagccaag ctgctgtgag 
gcggatttcc cagcttctca 
gctgtgagac cagctgctgc 
gctgccagat cagctcctgc 
aggagggcag cagtggagct 
tggagggcac ctacctaccc 
aactgcacca tgcccaggcc 
gcccagtctg ctgctgtgag 



ctaactctta acccaacttc 
cccagcttct ccatcagtgg 
accagctgct gccagccacg 
accagtggga cctgcagctc 
cagccaagct gctgtgagac 
ggaactggct gtggcattgg 
gtgagcaccc gtatcaggtg 
ccctgctgtg tggtgagctg 
tcctgctgcc gcccgtccta 
cccacttgct gaaagccagt 



tgacaccatg acctgctgcc 
gacctgtggc tccagctgct 
cagctgccag actagcttct 
cagttgctgc cagccaagct 
cagctgctgc cagccaagct 
tggtggcatc agctatggcc 
gtgccgccca gacagtcgtg 
cacgccccca tcctgctgcc 
ctgtggacag tcctgctgcc 
ttgcttattt tcaattgcct 
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aggtcacagt gtctctgaac tgttcatccc ttgaccacct ctggaccact aacaagttct 660 

cagactttgc attgcttgtg atggagacta ctaagtatat gagctcacaa ttctatctga 720 

ttccattcta caatgaatac cttgaccctt cactggggac acagaaatgc tacaaagcca 780 

cctgctgatc atcaatttgc ttgggatata ctatttctga tatttctgca ggattaaaaa 840 

ttactgacat gttgtggaat ttatccatga gaactatcca caagtctaat gtttccatgc 900 

tttataatct attttatctt gtttacctaa aattttttgc aacatcaaag acaccaaatt 960 

atagccaagt gacattcctc aagtcaccag agagaatgga agctcatcac ccaacattca 1020 

gcttctaaga agtaggctgg actttccaca ttttaacatc tgatccatcc cttggttttt 1080 

ggatcataafc gatcttgcct gctggatatt tcagttatat ctgtgataca atgtcttctg 1140 

tcatttctta ataaatatta tatactaggc aaagaaaaaa aaaaaaaaaa aaa 1193 
<210> 326 

<211> 986 

<212> DNA 

<213> Homo sapiens 



<400> 326 
aagcaaccca 
tggatatccc 
ctgtgagacc 
cttctcaact 
ctgctgccag 
tggcattggc 
ccgcccagac 
acccccaacc 
tggacagtcc 
gccagtttgc 
tttcttcaag 
ggggttacca 
tactcttctc 
taccttgaaa 
agaccatgga 
gcaaattgca 
tggtgcaaca 
<210> 327 



gacttcatac 
agctgctcca 
agctgctgcc 
agtgggacct 
ccaagctgct 
tatggccagg 
tgccgtgtgg 
tgctgccagc 
tgctgccgcc 
tgattttcaa 
cacttatgga 
aatattttgg 
tctgaggacg 
ctgagccttt 
agagccatct 
ccccctatga 
aaaaaaaaaa 



cagctcccaa 
ccagtgggac 
agccaagctg 
gcagctccag 
gccagaccag 
agggcagcag 
agggtacctg 
tgcaccacgc 
cagtctgctg 
cttgaaattt 
caacgaacaa 
cctcagaatt 
ccaaaataca 
gcaagcattg 
gtccttctca 
aagaggaata 
aaaaaa 



caccatgacc 
atgcggctcc 
ctgccagacc 
ttgctgccag 
ctcctgcgga 
tggagctgtg 
cctgcccccc 
cgaggcctcc 
ctgctactcc 
ccactttcag 
attcttcaac 
atctgattcc 
aatttgaccc 
aagcccacgc 
ggacactcac 
atataccaag 



tgctgccaga 
agctgctgcc 
agcttctgcg 
ccaagctgct 
actggctgtg 
agcacccgta 
tgctgtgtgg 
tgctgccgcc 
tgtgagccca 
ttccattcat 
ctttctttgt 
tttcaattcc 
aagaaatgaa 
tctgagtctc 
ttcctgtatc 
gtctaataaa 



ccagcttctg 
agccaagctg 
gatttcctag 
gtgagaccag 
gcattggtgg 
tcaggtggtg. 
tgagctgcac 
catcctactg 
cctgctaaaa 
gaacgaatta 
ctttcttatg 
agaaagacct 
aaagccgatt 
agcgccgacg 
ccaccgtcct 
ttttaactat 



60 
12 Of 

180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
966 



<211> 903 
<212> DNA 
<213> Homo sapiens 



<400> 327 

aataggcagc cataattcag aaactcctcc aagcaaccca accttcagat caactcctga 60 

caccatggcc tgctgtcaga ccagcttctg tggatttccc agctgctcca ccagtgggac 120 

ctgcggctcc agctgctgcc agccaagctg ctgtgagacc agctcctgcc agccacgctg 180 

ctgtgagacc agctgctgcc agccaagctg ctgccagacc agcttctgtg gatttcctag 240 

cttctcaacc ggtgggactt gtgactctag ctgctgccag ccaagctgct gtgaaactag 300 

ctgctgccag ccaagctgct accagaccag ctcctgcgga actggctgtg gcattggtgg 360 

tggcattggc tatggccagg agggcagcag tggagctgtg agcacccgta tcaggtggtg 420 

ccgcccagac tgccgtgtgg agggtacctg cctgcccccc tgctgcgtgg tgagctgcac 480 

acccccatcc tgctgccagc tgcaccacgc cgaggcctcc tgctgccgcc catcctactg 54 0 

tggacagtcc tgttgccgcc cagtctgctg ctgctactgc tctgagccca cttgttgaaa 600 

acctccttct gctggggatc ctgataagat ggcaccttaa aactagccaa attagaatcc 660 

taacaatctt ctgaactcca gtacctataa ctgggcttgc aacctctcat cacacagcca 72 0 

cataaattcc ctaggaagta aattcattta caatggaaga ccaaaaattt ttcctagacc 780 

tggttgtcag ccaaagtcct acaatgtgaa aagagttaga tactatttta ctataaatat 840 

cacctgaaat atttcaacag ttattgggac ttaaatttaa taaaagtttt catctcttca 900 

atg 903 
<210> 328 
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163 



<211> 615 
<212> DNA 
<213> Homo sapiens 



<400> 328 
cgtgacctgc 
ctgcgacccc 
gtgcacggct 
tgtgcagtcc 
caggacctcc 
cggccagccg 
tcacgaaggg 
cagagcaata 
atgcattgaa 
agtctaactc 
aaaaaaaaaa 
<210> 329 

<211> 812 



gtgccccgct 
tgctccctgc 
gtggtgtgca 
ccctgctgcc 
tcctgctgag 
accccttgca 
cccctcagaa 
cacgtttcct 
gacacctttt 
ctatatcatg 
aaaaa 



gcacgcgccc 
aggaaggctg 
ggccctgctg 
ggcccccctg 
ccaccacctg 
gcaccacctg 
gatggccagg 
tgagaagccc 
cagaccaacc 
ttgtctgctt 



catctgcgag 
ctgccgcccc 
ctgggccacc 
cggccagccg 
ctgccagcct 
caggacctcc 
tccatcccgc 
atttctcatc 
acagatgaga 
tctaataaac 



<212> DNA 

<213> Homo sapiens 



ccctgccgcc 
atcacctgct 
acctgctgcc 
accccttgca 
gtgtctgtgc 
tcctgctgag 
tgcccctcag 
tcttcatact 
aatacttttc 
tcaatactcc 



gcccggtgtg 
gcccctcgtc 
agcctgtgtc 
gcaccacctg 
agtccccctg 
cagcccgtta 
ggcttcaccg 
agctcacact 
ctaggactcc 
taccataaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
615 



<400> 329 
aatagcccaa 
cttgtcctcc 
ctgctgccgc 
ctgcacgcgc 
gcaggaaggc 
caggccctgc 
ccggcctccc 
agaccccaat 
aaaagttaaa 
ggatacagcc 
acatcagcct 
tgcaaacaca 
caatgtgttg 
aaaaaaaaaa 
<210> 330 



cccacaccag 
ctgagctacg 
cccgtgacct 
cccatctgcg 
tgctgccgcc 
tgctgggcca 
tgcggccagc 
gcccccacag 
agaaaaatag 
tggtgtaagc 
tagtcattat 
ttctacaacc 
ctcactgttg 
aaaaaaaaaa 



cctcagacac 

ggggaggctg 

gccagaccac 
agccctgccg 
ccatcacctg 
ccacctgctg 
cgaccccttg 
agcaatacac 
attttttttc 
agacgtccat 
ggatagtaaa 
tgatggtttt 
tcaataaact 
aaaaaaaaaa 



caccatgacc 
ctgccagccc 
cgtgtgccgc 
ccgcccggtg 
ctgcccctcg 
ccagcctgtg 
cagcaccacc 
tgaagcctaa 
acaaggtgac 
taccaccctc 
tcgaccttta 
tactgctcaa 
aatttttcct 
aa 



<211> 601 
<212> DNA 
<213> Homo sapiens 



ggctcctgct 
tgctgctgcc 
cccgtgacct 
tgctgcgacc 
tcgtgcacgg 
tctgtgcagt 
tgcaggacct 
acatctatct 
aatagtgatt 
acccacattt 
agaattcctg 
actgtcacca 
ggcaaaaaaa 



gcggctccac 
gcgacccctg 
gcgtgccccg 
cctgctccct 
ctgtggtgtg 
ccccctgctg 
cctcctgctg 
ggtgttttta 
tttaccatct 
tcaggtgtct 

gggtggactt 

tcatcttttg 
aaaaaaaaaa 



60 
: 120 
1 180 
'».' 240 
\ 300 
360 
420 
480 
540 
600 
660 
720 
780 
812 



<400> 330 
tgcttggtga 
gcctccttct 
ctctgtctgc 
ttgctgccag 
ctgttgcagg 
ctgccactca 
ctaccagccc 
tgtgtgtcca 
tatgaatcct 



gtgtggctac 
gctcgtcatc 
tctgaataga 
accacctact 
cccccgtgct 
agctgcagtg 
tcctgctgcc 
gctgctgctg 
cttgctgttg 



ttccttgaac 
tgcgattctc 
gctgtggcca 
gcagaaccat 
gccagtccct 
tgtccacctg 
acccttctgc 
tccagtctgc 
aacttcattc 



cacatggaga 
atctgacacc 
aggcctctgc 
ctgctaccat 
gtgctgcccc 
ctccaggccc 
tgcacttcta 
taccagacca 
ctgaccacca 



cacattgtac 
atggtcagct 
cagacctgct 
cccagctgct 
agctgctgca 
agctgttata 
gctgctgcca 
cctgctgtgg 
gccctggttc 



aggaagaagt 
tctgttgtag 
gctgctgcag 
ctgtgtccag 
tttctagccg 
atccccagta 
ccctggctgc 
tccagtctcc 
aaccaccttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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164 



ttgtcagtgt accagtcatt ctcattcccc ttctccactg gacctggcct tgccctgatc 
c 

<210> 331 



600 
601 



<211> 1202 

<212> DNA 

<213> Homo sapiens 



<400> 331 
cttcactctc 
ttgtggctcc 
ccgccccagc 
tgtgtccagc 
tcgccccagc 
ccgccccagc 
ccagcccacc 
tgtgtccagg 
ccgtcccagc 
ccgcccatgc 
ccacaccact 
tgcctcctct 
cagatgtaga 
tcaataggat 
acattctggt 
aatgtgaatt 
ttttctttat 
gaccagtttt 
tctcctaggg 
acttaagaaa 
aa 

<210> 332 



ttgaaaaccc 
gtgtgctctg 
tgctgtcaga 
tgctgcaggc 
tgctgtgaga 
tgctgtatgt 
tgctgccgcc 
tgctgcaggc 
tgctgcatct 
tgctgccgcc 
tgctatcgcc 
tgctgctgag 
cccttctact 
ggaccttatg 
tcattttaaa 
aatttgtaat 
cactttaagg 
gtcactgatg 
tgaatttctt 
aaaaaaaaaa 



acccagatcc 
accagggctg 
ccacctgttg 
cccagtgctg 
cgacctgctg 
ccagctgctg 
ccagctgctg 
cccagtgctg 
ccagctgctg 
cctgctgctg 
caacctgtgt 
cccactgccc 
gtgctgacca 
cttccaaaga 
ctccctccct 
acactagcta 
tacagattct 
ttgcaccctc 
atgctttgtt 
aaaaaaaaaa 



tccccgttct 
cggccaagtc 
caggaccacc 
ccagtctgtg 
ccaccctagg 
caagccccag 
catctccagc 
ccagtctgtg 
ccgcccctct 
cctgcgtcca 
catctccacc 
tggctcacgt 
ttaggataca 
gcccaccacc 
tgctttcttt 
agaaattatt 
ccttctcagt 
agatccagcc 
gcatctctgc 
aaaaaaaaaa 



gacaccatgg 
ctctgtcagg 
tgctaccgcc 
tgctgccaac 
tgctgcatct 
tgctgccagt 
tgctgtcgcc 
tgctgccagc 
tgctgtgaat 
gtctgtggcc 
tgtccccgcc 
cccccttcac 
tgaagtgggg 
atttcactga 
ttcttctggt 
ccaatcttct 
gaggtagata 
acccaattgt 
tttctaataa 
aaaaaaaaaa 



tcagctcctg 
agacctgctg 
ccagctgttg 
ccacctgctg 
ccagctgctg 
ctgtgtgctg 
ccagctgctg 
caacctgctg 
ccagctgctg 
gagtctcctg 
ccttgtgctg 
cactggccca 
ttgatgtcat 
ctctgtgaga 
ggtggcacca 
gatttcctta 
ttatctgcag 
attctgtgtt 
acttttctgc 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600< 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1202 



<211> 1219 

<212> DNA 

<213> Homo sapiens 



<400> 332 
ttcactctcc 
tgtggctccg 
cgccccagct 
gtgtccagct 
cgccccagat 
aagccccagt 
atctccagct 
cagtctgtgt 
cgcccctctt 

ggcggagtct 

cgccccttgt 
tcactgcagg 
gggattgatg 
ctgactctgt 
cagtcatggc 
ctctaaattc 
acacattatc 
cttggaaagg 
gtctctgttt 
tgttcacaaa 
aaaaaaaaaa 



tagaaaccca 
tgtgctctca 
gctgtgagac 
gctgcaggcc 
gctgcatctc 
gctgccagtc 
gctgtcgccc 
gctgccagcc 
gctgtgaatc 
cctgccacac 
gctgtgcctc 
cccacagttg 
tcattcagca 
gagaacattc 
accaaatatg 
ctcatttttt 
tccatttcaa 
taggaggctg 
gttcaagttt 
atatattgta 
aaaaaaaaa 



actagatcct 
ccaaggctgt 
cacctgctgc 
ccagtgctgc 
cagctgctgt 
tatgtgctgc 
cagctgctgt 
cacctgctgc 
cagctgctgc 
cacttgctat 
ctcttgctgc 
tagaccattc 
gggtggactt 
tggttcattt 
aaataatttg 
aaatcgtttt 
acatattatt 
cccctcccat 
gccagaattt 
ttaaactttt 



tcaccctctg 
ggccaagacc 
aggaccacct 
cagtctgtgt 
cgccccagct 
cagcccactt 
gtgtccagct 
caccccagct 
cgcccctgct 
cgcccaacct 
tgagcccact 
ttctgtgctg 
catgtttcca 
taaactccct 
taatccacta 
gagcctacag 
tgtctgtcag 
gctctcctgc 
ttctatttta 
catttagaaa 



acaccatggt 
tctgccagga 
actgtcgccc 
gctgccagcc 
gctgtgtgtc 
gctgccgccc 
gctgcagacc 
gcagcatctc 
gctgcctgcg 
gtgtcatctc 
gccctggctc 
actattagga 
atgagcccat 
cccttgcttt 
gctaagaaat 
aatatccttc 
cctttcagtc 
tttctccctg 
ttggttcttt 
aaaaaaaaaa 



aaactcctgt 
gacctgctgc 
cagctgctgt 
cacctgctgc 
cagctgctgc 
cagatgctgc 
ccagtgctgc 
cagctgctgc 
tccagtctgt 
cagctgcccc 
atctctccct 
cacatggagt 
caccatccca 
ctttctcttc 
tattccaatc 
ccagtgaggt 
attcttttct 
ttctctcttt 
atctttattg 
aaaaaaaaaa 



60^ 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1219 
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rhhrectaaa atcctcaccc tctctgtttt ccaaaacctc cccccattca cattacatat 2460 
atctaacc?? lagtgacata tgacalagtg tccaacataa gaatttaggc cacctaggac 2520 

SSSSS SS SSSSS SSSSS 
SSSSS SjgjA jjtjggj. — ? « ?s „.. 

gagggagctt atagtgagag 9cattctccc taoccca ctacccccat ctgtccgccg 2.820 
SSSSS SSSSS SSSSS SSlcalc. ggtgcacttc ctggagcagc 3120 

SSS SSSSS SSSSS SSSSS SSSSS 

ill- sssss sssss sssss sss | 

ssss iiii iiii iiii Ei sssss ss 

SSSSS i S 1 SSSSS SSSSS SSSSS i«. 

gaaggaggag aaa 99 a 9 a 99 gccagaacc 999^99 CCC caccctg acccacatca 372^ 

gcccagaaac agagaggagg tcttcgaaaa cgcgcagyyi- » faaatoccca 378dSBr 

Igaatgaaag tccctgaaat agaaagagct ttgagtaaca tgggttggtt tgaatgccca 

llagalaggi atggacaagt tcccttcagt ccaacacttt t^tgattctt ggaccaccc 38 

5S sss s iii m ihss s 

•52 SS SSs •§§ if s iSi ss 

sssss sssa —sss sssss sssss iss^s «»• 

B 55B5S sssss ssjss s i i 

SSSSS SSSSS SSSSS i f ^KSS SS 

S35S sssss sssss s s SSSccc. = g»= S ca 

gatcttaagt tttggtgata gtgcatgaca tggattctag a 9*^tcatc ccaguggg g 

3S3 as iSls 3SS gil SS5 s 

agcaaggagg atatgagtta 9^*99** gagcaggtga agaagtgtgg 5220 

atggtgggac aaggggagaa accccaggca 99ttcaccac gage gg g aagtgacaa t 5280 

sssss sssss sssss sags sag-- 5 ~w »« 

SSSSS SSSS SSSSS SSSSS SSSSS jSjgS. s«o 

sssss sssss sssss sssss SSSSS SS S jm. 

latccgcgtt ctccagtcgc acatctcaga cacctcagtg 9tcgtcaaga tgga^ g ^ 

tegggacctg aacatgeact gtgtcatcac tgagatcaaa gctcag g | 576Q 

cacccgcagc egggecgagg ccgagtcctg gtaccgcagc aaggtg g g g ttc * ttt 582 o 
cctgcctgct agacatggca gttggaggga ^gcaaggtac ctattaaata 9|ccccc 

tctggggatt ctggtcccta cggatgggaa jataagggct 9«^ctctga 99^9999^ ^ 

gcagggcagg actgecatgt gt9gttgcac ^^agca ^gcac a I tctacttc 600 0 

SSSSS SSSSS SSSSS SI S S.g. r Sg c f ~ ;s . 

sssss sssss sss s s sigjai ? p gg, si = ss 

cctcagcaga gggatttggg tt»9» c «^ 222H2 !2a??ctcgc Igaatggagg 63 00 
SSSSS SSSSS SSSSS SSSSS SSSSS gfggaaaeag 6 3S0 
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cggcatcatc 
ggctgaagcc 
ggacgacctc 
gcaggctgag 
ggctgaggag 
agccgccctg 
catgagcgtg 
cgaggagagc 
cactggtggc 
caatgccctg 
gaccgcatcc 
tccagccacc 
agacagtgag 
tgaggcctga 
aaaaaaaaaa 
<210> 365 



gctgaggtca 
tggtaccaga 
cggaataccc 
atcgacaaca 
cgtggggagc 
cagcgggcca 
aagctggccc 
cggtfcggctg 
agtagcagtg 
agcttctcca 
gccagtcgca 
acccacaatc 
acagtctgga 
gtgatccacg 
aaa 



aggcacagta 
ccaagtttga 
ggaatgagat 
tcaagaacca 
tggcgctcaa 
agcaggatat 
tggacatcga 
gagatggagt 
gcggtggcat 
gcagtgcggg 
ggagtgcccg 
acaagaagat 
aagtgatgtc 
tgaaaaaaaa 



tgaggagatg 
gaccctccag 
ttcagagatg 
gcgtgccaag 
ggatgctcgt 
ggcacggcag 
gatcgccacc 
gggagccgtg 
tgggctgacc 
tcctgggctc 
cgactgagcc 
tcccacccct 
agaatagctt 
aaaaaaaaaa 



gccaaatgca 
gcccaggctg 
aaccgggcca 
ttggaggccg 
gccaagcagg 
ctgcgtgagt 
taccgcaagc 
aatatctctg 
ctcgggggaa 
ctgaaggctt 
gcctcccacc 
gcctcccatg 
ccaataaagc 
aaaaaaaaaa 



gccgggctga 
ggaagcatgg 
tccagaggct 
ccattgccga 
aggagctgga 
accaggaact 
tgctggaggg 
tgatgaattc 
ccatgggcag 
attccatccg 
actccactcc 
cctggtccca 
agcctcattc 
aaaaaaaaaa 



96< 
1021 
108< 
114< 
120< 
126< 
132< 
138( 
144( 
150( 
156( 
162< 
168( 
174< 

175: 



<211> 9721 



<212> DNA 



<213> Homo sapiens 



<400> 365 
aaactccctt 
gagaagagga 
gggacacagg 
aacatttaca 
ctcattagat 
agaaaactga 
actgacttca 
cgctgcgggg 
aaagctcgga 
gcgcacgggc 
gaacgccgtt 
ggcgcgaggg 
cccgcgggcc 
ccaccgcacc 
cagctgcatc 
ccgcggcgtc 

gggcttccgc 

acccagcccc 
cctggagatc 
cctcaacagc 
tgaacctgga 
tgaggatggc 
ccgaaacacg 
gtgggaaaaa 
tcctgcttct 
actagaattt 
ctctggtggc 
acttatagat 
tgggtgtaca 
cagagagtgg 
gggctagaag 
tcacctcaac 
ctggcccagg 
gtgtggcact 
tgtggtgcag 
atgagacccc 
ccccatgtct 
aggaccccag 
ggaaaactgg 
gtattgttgc 



cccctcgccc 
ggtatgagct 
atgaatgata 
cagttcttat 
cctggcaacg 
ggctcagaaa 
gaggccccac 
gaccggggct 
gtcccggcgg 
agcggggctg 
ttatgcaggc 
ccgagccctc 
gtcatataag 
atgtcctgcc 
tcggcctgcg 
tcctgctacc 
gccggctcct 
ccatgcatcg 
gaccccaacg 
aggttcgcgg 
agtcctggaa 
tgatatttta 
tggtaagaaa 
aatatcactg 
aataaagaga 
aatgttctgt 
catatgaagt 
aaattaggag 
cagaagagaa 
gtgccagaac 
gtccacagct 
aggcactgca 
agcaggccac 
gggcagaggc 
tgtgtgagct 
atagccaaga 
gcaagttcca 
ggcttaaagg 
aagccaaggg 
catgtgttcc 



aagcagagtg 
gagtcctgag 
aattaacaat 
taagtgatag 
gctctcagtg 
ggttattcag 
actctcgaca 
atgggactgg 

aggggcggac 

accaggcgct 
ggccgggcgg 
caagctcgta 
gcgcagcgcc 
tctactcccg 
ggccccggcc 
gcggcctcac 
gcggacgcag 
ccaccgtgtc 
cacagtgcgt 
ccttcatgga 
ggagatggaa 
tataatgtta 
atacatcttc 
gatgctaatg 
gcaagccccg 
aatttccctt 
tactctttta 
taaaatagta 
aggaggcaga 
agggttccaa 
tgggatccca 
cgctggagag 
ctggcctcca 
ttcccttctc 
ggccatgact 
ccacctgaaa 
acctgggaat 
gaaggagaca 
ccctagtttg 
tccagtttca 



gagaggggta 
atagaagaga 
agaataaata 
gtagggccct 
cttactatta 
ctgcccaagg 
cacgcacaca 
gaccgaagtg 
aggtggaggg 
tcccggcccc 
cctggctcgc 
aacgccttgg 
ggcttggggt 
cctcagcgcc 
cggccgctgc 
cggaggcttc 
cttcggctac 
ggtcaacgag 
gaagcaggag 
caaggtgggt 
attcccagtg 
ggcggtgccg 
tttaatgtct 
tttttagcat 
gcttcagaac 
tatgtatgcc 
accccatacg 
tatgggatgg 
gagagggtga 
gcccaggccc 
gctgccccac 
gtgacccagg 
tgctcagtta 
caccccagtg 
ctgaaaggga 
ttatgccatg 
tcccagctgc 
tctcagctcc 
tttgctgaca 
gtgaaatcca 



caggaaggag 
agagaaagtt 
gaacaaataa 
aaatgctttc 
ttatcgctat 
tcatgacatg 
ctcgccccac 
gaggccggat 
tttcactcgc 
ataaaactcg 
tttcattccc 
ccgcgagctc 
ccagccgccc 
ccctgcgggg 
tgcatcaccg 
ggcagccaca 
cgctccgggg 
agcctcctca 
gagaaggagc 
gtcctggatc 
gtctctctag 
gcccacaaca 
gtttcaatcc 
ctctctaata 
tgtctaggac 
tagtaatgat 
tgtaaatgtt 
caaaatcctg 
ggggctggcc 
tctccaactc 
tccctctaga 
agcggctcgt 
agttcctcta 
gcttcctcac 
aaatggggcc 
taatatcagc 
actcatggag 
agctcaaatg 
tcctttctgg 
ggaaaagcgt 



ggacgcttac 
tttgcccaaa 
tgaataaata 
tggtcattat 
tttgcctgtg 
aacctgcggg 
acacggtgcc 
ccaacaaggc 
actgaccgca 
ggcgggaggc 
ggctccgcga 
cctccccggg 
gcccctgccg 
tccgcgcctt 
ccgcccccta 
gcgtgtgcgg 
gcgtgtgcgg 
cgcccctcaa 
agatcaagtc 
acacctttcc 
aaatggggaa 
ttaaataaca 
ttaagattag 
cttgcttccc 
aactgtttct 
ggttttgcat 
aattgaatcc 
aagtcagcct 
caagggccca 
tccagcccct 
aggggctggt 
ggaatggaca 
ctgacgcatt 
cagggctgtg 
caaggcagcc 
agtccccact 
ttggatgaaa 
tacgagaagt 
ccagcacaat 
acaagcctca 



6C 
12 C 
18C 
24C 
30C 
36C 
42C 
48C 
54C 
60C 
66G 
72C 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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cgggtcagga ccctgctctc ctggcgcagt tcccagctat ctcccctgct cctctgctgg 
tggtgggcta ataaagctga ctttctggtt gatgcaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaa 
<210> 363 

<211> 1512 

<212> DNA 

<213> Homo sapiens 



<400> 363 

ctcctctcca gcccttctcc tgtgtgcctg cctcctgccg ccgccaccat gaccacctcc 60 

atccgccagt tcacctcctc cagctccatc aagggctcct ccggcctggg gggcggctcg 120 

tcccgcacct cctgccggct gtctggcggc ctgggtgccg gctcctgcag gctgggatct 180 

gctggcggcc tgggcagcac cctcgggggt agcagctact ccagctgcta cagctttggc 240 

tctggtggtg gctatggcag cagctttggg ggtgttgatg ggctgctggc tggaggtgag 300 

aaggccacca tgcagaacct caatgaccgc ctggcctcct acctggacaa ggtgcgtgcc 360 

ctggaggagg ccaacactga gctggaggtg aagatccgtg actggtacca gaggcaggcc 420 

ccggggcccg cccgtgacta cagccagtac tacaggacaa ttgaggagct gcagaacaag 480^ 

atcctcacag ccaccgtgga caatgccaac atcctgctac agattgacaa tgcccgtctg 5401 

gctgctgatg acttccgcac caagtttgag acagagcagg ccctgcgcct gagtgtggag 600 

gccgacatca atggcctgcg cagggtgctg gatgagctga ccctggccag agccgacctg 660 

gagatgcaga ttgagaacct caaggaggag ctggcctacc tgaagaagaa ccacgaggag 72 0 

gagatgaacg ccctgcgagg ccaggtgggt ggtgagatca atgtggagat ggacgctgcc 780 

ccaggcgtgg acctgagccg catcctcaac gagatgcgtg accagtatga gaagatggca 840 

gagaagaacc gcaaggatgc cgaggattgg ttcttcagca agacagagga actgaaccgc 900 

gaggtggcca ccaacagtga gctggtgcag agtggcaaga gtgagatctc ggagctccgg 960 

cgcaccatgc aggccttgga gatagagctg cagtcccagc tcagcatgaa agcatccctg 1020 

gagggcaacc tggcggagac agagaaccgc tactgcgtgc agctgtccca gatccagggg 1080 

ctgattggca gcgtggagga gcagctggcc cagcttcgct gcgagatgga gcagcagaac 1140 

caggaataca aaatcctgct ggatgtgaag acgcggctgg agcaggagat tgccacctac 1200 

cgccgcctgc tggagggaga ggatgcccac ctgactcagt acaagaaaga accggtgacc 1260 

acccgtcagg tgcgtaccat tgtggaagag gtccaggatg gcaaggtcat ctcctcccgc 1320 

gagcaggtcc accagaccac ccgctgagga ctcagctacc ccggccggcc acccaggagg 1380 

cagggagcag ccgccccatc tgccccacag tctccggcct ctccagcctc agccccctgc 144,0 

ttcagtccct tccccatgct tccttgcctg atgacaataa agcttgttga ctcagctaaa 150.0 
aaaaaaaaaa aa 
<210> 364 

<211> 1753 

<212> DNA 

<213> Homo sapiens 



1512 



<400> 364 

cagccccgcc cctacctgtg gaagcccagc cgcccgctcc cgcggataaa aggtgcggag 60 

tgtccccgag gtcagcgagt gcgcgctcct cctcgcccgc cgctaggtcc atcccggccc 120 

agccaccatg tccatccact tcagctcccc ggtattcacc tcgcgctcag ccgccttctc 180 

gggccgcggc gcccaggtgc gcctgagctc cgctcgcccc ggcggccttg gcagcagcag 240 

cctctacggc ctcggcgcct cgcggccgcg cgtggccgtg cgctctgcct atgggggccc 300 

ggtgggcgcc ggcatccgcg aggtcaccat taaccagagc ctgctggccc cgctgcggct 360 

ggacgccgac ccctccctcc agcgggtgcg ccaggaggag agcgagcaga tcaagaccct 420 

caacaacaag tttgcctcct tcatcgacaa ggtgcggttt ctggagcagc agaacaagct 480 

gctggagacc aagtggacgc tgctgcagga gcagaagtcg gccaagagca gccgcctccc 540 

agacatcttt gaggcccaga ttgctggcct tcggggtcag cttgaggcac tgcaggtgga 600 

tgggggccgc ctggaggcgg agctgcggag catgcaggat gtggtggagg acttcaagaa 660 

taagtacgaa gatgaaatta accgccgcac agctgctgag aatgagtttg tggtgctgaa 720 

gaaggatgtg gatgctgcct acatgagcaa ggtggagctg gaggccaagg tggatgccct 7 80 

gaatgatgag atcaacttcc tcaggaccct caatgagacg gagttgacag agctgcagtc 84 0 

ccagatctcc gacacatctg tggtgctgtc catggacaac agtcgctccc tggacctgga 900 
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<400> 361 



cftol^ ottctettc. ctcagccaac tgctcgctcg ctcacctccc tcctctgcac 60 
™« tgcagccgcc agttcacctc ctccagctcc atgaagggct cctgcglcat 120 

cgggggcggc atcgggggcg gctccagccg catctcctcc gtcctfilcg gacSgtSctq 
SEES? agca = Ctacg 3939C99cct gtctgtctca fcctcl?gc? tSSSSS 

??? ct 99™ gcggctatgg cggtggcttc agcagcagca gcagcagc?? 

^ 93gg3a 9 Satatggtgg Cggccttggt gctggcttgg Itgltggctt 
tggtggtggc tttgctggtg gtgatgggct tctggtgggc agtgagaagg tgaccataca 
gaacctcaat gaccgcctgg cctcctacct ggaSJgtg cgtfctrtg? £fj 
cgccgacctg gaagtgaaga tccgtgactg gtaccagagi dglggcctl ^tlltcaa 
tSS^SHt CCCta *" ca agaccattga ggacctfag aalaffattf tcfcfgcSS 
agtggacaat gccaatgtcc ttctgcagat tgacaatgcc cgtctggccg cggatqact? 
ccgcaccaag tatgagacag agttgaacct gcgcatgtgt gtggalgccg SSScSSS 
cctgcgcagg gtgctggacg aactgaccct ggccagagct lalltggag? tsSSitta? 
gagcctgaag gaggagctgg cctacctgaa gaagaaclac iaggagSa tgaafSccf? 
gagaggccag gtgggtggag atgtcaatgt ggagatggac gctqcaccta aS^f 
gagccgcatt ctgaacgaga tgcgtgacla llatgag'Sg Itggcagaga SSSSSSZ 
ggatgccgag gaatggttct tcaccaagac agaggagcti aallgc£af g Sgccaccaa 
cagcgagctg gtgcagagcg gcaagagcga gatctcggag ctccgglglf «Ito«a^ 
cctggagatt gagctgcagt cccagctcag catgaalgcl tccc?gg?ga acaScSSa 
ggagaccaaa ggtcgctact gcatgcagct ggcccagttc caggagfga ttjgcag?? 3 
ggaggagcag ctggcccagc tccgctgcga gatggagcag calialcagg asSSSES 
Saa 9 o^[~ 9tgaagacgc ggctggagca ggagatcgcl acltaccgc? glctgcSS 
gggcgaggac goccacctct cctcctccca gttctcctct ggatcgcagt catc?aaSa 
tgtgacctcc tccagccgcc aaatccgcac caaggtcatg gatgtgcacq atcSSEf? 

SSSSS tCC " CgCaC -aSactgl SESSSS £SgS3£ 
«^ a ? 9 " ■ a ? ooce B t a t g9acacagat cccactggaa gatcccctct cctgcccaaf isso 
?a?ctgag a ? ^ g f CCCtg <="caccctc accccctcct fgcaatcaat acag'cttca? SJJ 
<210> 362 1634 

<211> 1688 

<212> DNA 

<213> Homo sapiens 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



<400> 362 



60 
120 
180 
240 
300 
360 
420 
480 
540 



acagcacgct ctcagccttc ctgagcacct ttccttcttt cagccaactg ctcactcact 
cacctccctc cttggcacca tgaccacctg cagccgccag ttlaccteet 
gaagggctcc tgcggcatcg gaggcggca? cgggg^gg? tccagccgca tct?ctcc«t 
cctggccgga gggtcctgcc gtgcccccag cacctacglg ggcg5cctat cS£e?^£ 
c^"" tCC S? 9 "*™' cctgcgggc? ggggggcgg ??tgcgg?g gc?tcagcaS 
SSSSS " t9g ^ 9 ^ SCttcggggg aggatatglt ggtggc??tg Sgc?g?c?t 
cggtggtggc ttgggtgctg gctttggtgg tggttttgct ggtaqtaata oal^^»^ 
gggcagtgag aaggtgacca tgcsoeacc* r2*+c=~-~~ S™-?.*? W^tctggt 
ggtgcgtgct ctggaggagg ccaaSgccga cctgga^g ^gatccg^ actoafacca 

sssss ssss ssss ttthii ssS m 

Ell? SSSSS SSSSS SSS5 sssss I Si ™ 

Si? 3 *T S "- ="S?S S3ESK S5~ S55K 

ggatgctgca cctggcgtgg acctgagccg catcctgaat gagatqlqfq accaa^S 
gcagatggca gagaaaaacc gcagagacgc tgagacctgg tSJSSS aaa^^f. 
gctgaacaaa gaagtggcct ccaacagcga actigtacS agcag?clca alalan?^ 
ggagctccgg agggtgctcc agggcctgga gatfgagc^g Sgtlccfgc ?cI2ftgaa 
agcatccctg gagaacagcc tggaggagac caaaggccgl taltgcatac aSSSSS- 
gatccaggga ctgattggca gtgtggagga gcagl!gg?c cagc?ac5ct 1 
gcagcagagc caggagtacc agatcttgct ggaSgtgag aScggSgg SSfSSf 

tfSccaa^e * g ~ g ~ tgc tggagggcga ggatgclcal cttSSSS 2££5£ 
tggccaatcc tattcttccc gcgaggtctt cacctcctcc tca^-r^^n^^ rZZZZZZ Z 

iSSST^ atCctcaa ^ agcagagctc atccagcttc Sclgggcc SgcScS 
gaactgagct gcctctacca cagcctcctg cccaccagct glcctSSct Sgaaggcc 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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accaqcggct catggacatc aagtcgcggc tggagcagga gattgccacc taccgcagcc 
?qctlgSgg acaggaagat cactacaaca atttgtctgc ctccaaggtc ctctgaggca 
Iclllcllll gggcltclgc tgtcctttgg agggtgtctt ctgggtagag ggatgggaag 
laaggacc? ?Iacccccgg clcttctcct gacctgccaa taaaaattta tggtccaagg 
gaaaaaaaaa aaaaaaaaaa aaaaaaa 
<210> 360 



1200 
1260 
1320 
1380 
1407 



<211> 2352 



<212> DNA 

<213> Homo sapiens 



aaaaaaaaaa aa 
<210> 361 

<211> 1634 

<212> DNA 



180 
240 
300 

420^ 

480 
540 
600 
660 
720, 



ccc?acact? gggagccggt agcactccta tcactgcttc tcaacccgtg agctaccagc 
?g?g?catgl gc?gcaga1a gltctcctcg tcctacttga ccagcggcgg S^tggcggg 
ScScctgg Icagcgiggg cagcataagg tcttcctaca gccgcttcag ^tcctcaggg 
oo^lataaaa aaQqaiaccg attcagctct tctagtggct atggtggggg aagctctcgt 
Itntnlaaal llllllltll tggcaitttt ggctacagct acggcggagg atctgggggt 
£?t?tS?g agcg^tggc Stgggggtg 9ttccagagg "ttggtggt 

gcttctglag gaggctatag tagttctggg ggttttggag gtggctttgg tggtggttct 
aaaaatalct ttggtggtgg ctatgggagt gggtttgggg ggttaggggg ctttggaggt 
lllllllltt gaStStii tggtlllctg actgctaatg agaagagcac catgcaggaa 
c?cSt???? gg??ggccl? tllcttggat aaggtgcagg ctctagagga ggccaacaac 
ga?c?ggJga SLgtcca ggattggtac gacaagaagg gacctgctgc ^atccagaag 
LctacEccc cttattataa cactattgat gatctcaagg accagattgt Macctgaca 
g?gggcaaca acaaaactct cctggacatt gacaacactc gcatgacact Sfatgacttc 780 
aggliaaagt ttgagatgga gcaaaacctg cggcaaggag tggatgctga ^atcaatggc 840 ? 
Sacaqq tgctggacaa tctgaccatg gagaagtctg acctggagat gcagtatgag 900, 
ac?c?gca?g 2g3ctgat ggccctcaag aagaatcata aggaggagat gagtcagctg 
aSSaqa acagtggaga tgtcaatgtg gagataaacg ttgctcctgg caaagatctc 
accaaSc?c tcaStSclt glgtcaggag tatgagcagc tcattgctaa gaacagaaag 
gaca?cgSa atcla?atga laltcalata acccagatcg agcatgaggt atccagtagt 
gg?cag£gg tgcagtcclg Igccaaggag gtgacccagc tccggcacgg tgtccaggag 
ttQaaalttg agctgcagtc tcagctcagc aagaaagcag ctctggagaa gagcttggaa 
ga^Saagl accgctactg tgglcagctg cagatgatcc aggagcagat Jgtaacttg 
aaoacccaaa tcactgacgt ccggcaagag atcgagtgcc agaatcagga atacagccct 
X?ctcaS?a ttaagSgcg gctigagaag gaaatcgaga cctaccacaa cctccttgag 
gglggcclgg aigSS?gI ltcc?ccggl gctggaaaaa ttggccttgg aggtcgagga 
Jaaagtggag gcagttatgg aagaggatcc aggggaggaa gtggaggcag ctatggtgga 
llllllllll ISItggct a tggtggagga agtgggtcca ggggaggaag tggaggcagc 
?acStgS2 SStSttc tHallaigt agtggaggtg gctatggtgg aggaagtgga 
aatScSta gcggalgaag tggaggtggt catagtggag gaagtggggg caactatgga 
SaaSgaSgtg gc?ctSagl aSalltggg ggtggctatg gtggaggaag tgggtccagg 
aaSSaltg gaggcalcca tggtggagga agtggttttg gaggtgaaag tggaggcagc 
Scgaglc? ?5aIgLgc ggtSUagt ggtggcggct acggaggagg aagcggaaaa 
tSSSStt Iclagtcttc ttcctcaaaa tctggtgacc aagatgagac aaaaggcttc 
ctttcgcgat actafagcct ctgtaaactt ctcctgccca gcccccagct fagcatcccc 
agatglgcag actccclatg aagcctgctt attggatcct ^catcagga atagctggca 
alaaacccct gtgcccgttt gggactgaac tgacttggca tatgtgctct ggctttccat 
tggc?tccaa cc??gc?tgg cStggatgc cttcagggtt cccgagacag acctctttcc 
tS?ctctggc ctgglagcEc tcacaccctg cacgatcttg actataataa agcttcccta 
SgcaaaSL aalalaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



960/ 

102 o r ; 

1080;* 

ii4b; : 

1200^. 
1260; 
1320- 

13 80; 

1440 
1500 
1560 
1620 
168^ 
174^ 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2352 



<213> Homo sapiens 
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<212> DNA 

<213> Homo sapiens 



<400> 358 

ggtacctcct 

catgaccacc 

gggttccctc 

aagccgaagt 

tgggggtggc 

ctttggaggg 

cctctctggc 

ggacaaggta 

gtaccagaag 

tgaagagctc 

gatcgacaat 

cctgcgccag 

cctggccagg 

gaagaagaac 

tgtggagatg 

gcagtacgag 

gactgaggag 

ggagatcaca 

cagcatgaaa 

gctgcagcag 

cgagatggag 

gcaggagatc 

tggcatcagg 

agaagagtca 

tgcaggagaa 

gagggcctgc 

tcttttattt 
ggagcaagga 
aataaagaat 
<210> 359 



gccagcatct 
acatttctgc 
ctggctgggg 
atctcagctt 
atgagggtct 
ggcgttggtg 
aatgagaaaa 
cgtgccctgg 
cagaccccaa 
cgggacaaga 
gccaggctgg 
ggcgttgagg 
actgacctgg 
cacgaagagg 
gacgcagcac 
gccatggcgg 
ctgaacaaag 
gacctgagac 
gctgggctgg 
atccaggggc 
gctcagaacc 
gctacttacc 
gaagcctctt 
gtggatggac 
acgtcccttg 
acatgcaaac 
cagggctttg 
gatgcagctg 
tacttttcct 



cttgggtttg 
aaacttcttc 
gaggtggctt 
cttctgctag 
gtggctttgg 

ggggttttgg 

ttaccatgca 
aggaggccaa 
ccagcccaga 
tcatggccac 
ctgcggacga 
ctgacatcaa 
agatgcagat 
agatgaagga 
cgggtgtgga 
agaagaaccg 
aggtggcctc 
gcacgatgca 
agaactcact 
tcattggtgg 
aggagtacaa 
gcagcctgct 
caggaggtgg 
aggtggtttc 
ccactcccca 
tccagtccct 
catgcgctct 
tattgtgtaa 
tttgcaaat 



ctgagaactc 
ctccaccttt 
tggtgggggg 
gtttgtctct 
tggaggggct 
tggtggcttt 
gaacctcaat 
tgctgacctg 
atgcgactac 
caccatcgac 
cttcaggctc 
cggcttgcgc 
cgagggcctg 
gttcagcagc 
cctgacccgt 

ccgggatgtc 

caacacagaa 
ggagctggag 
ggccgagaca 
cctggaggcc 
gatgctgctt 
cgagggccag 
tggtagcagc 
ttcccacaag 
ctctcatcag 
gccttcagag 
attccccctc 
caagctcatt 



acgggctcca 

gggggtggct 

agtctctctg 
tcagggtcag 
ggtagtgttt 

ggtggtggcg 

gaccgcctgg 
gaggtgaaga 
agccaatact 
aactcccggg 
aagtatgaga 
cgagtcctgg 
aatgaggagc 
cagctggccg 
gtgctggcag 
gaggcctggt 
atgatccaga 
atcgagctgc 
gagtgccgct 
cagctgagtg 
gacataaaga 
gatgccaaga 
agcaatttcc 
agagaaatct 
gccaagtgga 
agctgaaaag 
tgcctctccc 
tgtacagtgt 



gctacctggc 
caacccgagg 
ggggaggtgg 
gaggaggata 
tcggtggagg 
atggtggtct 
cctcctacct 
tccatgactg 
tcaagaccat 
tcatcctgga 
atgagctggc 
atgagctgac 
tagcctacct 
gccaggtcaa 
agatgaggga 
tcttcagcaa 
ccagcaagac 
agtcccagct 
atgccacgca 
agctccgatg 
cacggctgga 
tggctggcat 
acatcaatgt 
aagtgtctat 
ggactggcca 
ggtccctcgg 
caccttcttt 
ctgttcatgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1709 



<211> 1407 

<212> DNA 

<213> Homo sapiens 



<400> 359 

cgcgaatcgc 

cctacagcta 

tgcgttttgg 

gcggcgtatc 

gcggctacgg 

ccatgcagaa 

cggccaacgg 

cctcccgcga 

gtgccaccat 

atgacttccg 

tcaacggcct 

agatcgaagg 

gtacgctgag 

ccgatctcgc 

accggaagga 

ctggccacac 

ttcagggtct 

cactggcaga 

gcggtattga 



agcttctgag 
tcgccagtcg 
gccgggggtc 
cgtgtcctcc 
cggcgtcctg 
cctcaacgac 
cgagctagag 
ctacagccac 
tgagaactcc 
aaccaagttt 
gcgcagggtg 
cctgaaggaa 
gggccaagtg 
caagatcctg 
tgctgaagcc 
ggagcagctc 
tgagattgag 
aacggaggcg 
agcccagctg 



accagggttg 

tcggccacgt 

gcctttcgcg 

gcccgctttg 

accgcgtccg 

cgcctggcct 

gtgaagatcc 

tactacacga 

aggattgtcc 

gagacggaac 

ctggatgagc 

gagctggcct 

ggaggccagg 

agtgacatgc 

tggttcacca 

cagatgagca 

ctgcagtcac 

cgctttggag 

ggcgatgtgc 



cgtccttcgg 

cgcccagcat 

tgtcctcgtc 

acgggctgct 

cctacctgga 

gcgactggta 

ccatccagga 

tgcagatcga 

aggctctgcg 

tgaccctggc 

acctgaagaa 

tcagtgtgga 

gaagccaata 

gccggactga 

ggtccgaggt 

agctgagcat 

cccagctggc 

gagctgatag 



yCtuCycctc 

aggcctgggc 

tcacgggggc 

ctcctcgggg 

ggcgggcaac 

caaggtgcgc 

ccagaagcag 

cctgcgggac 

caatgcccgt 

catgagcgtg 

caggaccgac 

gaaccatgag 

ggtggattcc 

tgaggtcatg 

agaattgaac 

tactgacctg 

gaaagctgcc 

gcatatccag 

tgagcggcag 



gccatgactt 

ggcggctccg 

tccggcggcc 

gcctacggcg 

gagaagctaa 

gccctggagg 

gggcctgggc 

aagattcttg 

ctggctgcag 

gaggccgaca 

ctggagatgc 

gaggaaatca 

gctccgggca 

gccgagcaga 

cgggaggtcg 

cggcgcaccc 

ttggaagaca 

gcgctgatca 

aatcaggagt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



BHC03 1 051 



- 178 



agtctcgccg 

agctgcttgc 

gggggctggt 

aacgaccgcc 

ctggagagcc 

cagtcctact 

aatgccaggc 

aagtatgaga 

aggatcctgg 

aaggaggagc 

caacttgggg 

atcctggagg 

gaggcctggt 

cagctgcagt 

attgagctgc 

gaggcccgct 

cagctgtctg 

gacgtcaagg 

gactgcaagc 

tctgtccccc 

actcagatcc 

cacgtgcagt 

<210> 357 



gtgctgcagg 

ctggctccta 

tctgcgaggg 

tggccaacta 

gcatccagga 

tcaagaccat 

tggtcctgca 

cagagctgtc 

atgagctgac 

tgatgtgcct 

accgactgaa 

atatgagatg 

tcaacaccca 

gctgccagac 

aggctcagca 

acagctccca 

agatccgctg 

cccggctgga 

ttcctcccca 

cggtgccctg 

gcaccatcac 

cccgcccgct 



gtacatctct 

cctgtcttct 

ctccttcaac 

cctggagaag 

gtggtacgag 

cgaagatttc 

gattgataat 

tctgcggcag 

cctgtgcaag 

caagaagaat 

tgtggaggtg 

ccagtacgag 

gactgaggag 

ggagatcatc 

cagcatgcgg 

gctggcccag 

cgacctggag 

gggcgagatc 

accttgtgcc 

tgtcccctct 

cgaggagatc 

gtga 



tctgctaggt 

gagtgccaca 

ggcagcgaga 

gtgcgtcagc 

tttcagatcc 

cagcagaaga 

gccaagctgg 

ctagtggagg 

gctgacctgg 

cacgaggagg 

gacgctgctc 

gccctggtgg 

ctgaaccagc 

gagctgagac 

aattccttgg 

atgcagtgcc 

cggcagaacc 

gctacctacc 

acggcatgca 

gtgccctgca 

agagatggga 



cgggcctctc 
cctctggctt 
aggagactat 
tggagcggga 
catacatctg 
tcctgctgac 
ctgctgacga 
ccgacatcaa 
aggctcaggt 
aagtcagtgt 
ccccagtgga 
agaataaccg 
aggtggtgtc 
gtacggtcaa 
aatccaccct 
.tgatcagcaa 
aggagtacca 
gccacctgct 
agcctgttat 
ccccggctcc 
aagtcatctc 



tggccttggg 
tgtggggagc 
gcagttcctg 
gaacgcggag 
cccagactac 
taagtctgag 
cttccggacc 
cggcctgcgt 
ggagtccctg 
actccgttgc 
tctcaacaag 
cagagatgtg 
cagctcggag 
cgcgctagag 
ggccgaaacc 
cgtggaggcc 
ggtgttactg 
ggagggagag 
tagagttcct 
ccaggttggc 
ctccagggag 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1404* 



<211> 1693 

<212> DNA 

<213> Homo sapiens 



<400> 357 

cacagtcctc 

accatgagcc 

tcttgccagc 

ggatcagctg 

ggctttggag 

ggctttggtg 

ctcactggca 

gagaaggtgc 

cacctgaagc 

gaagagctcc 

attgacaatg 

ctgcgccaga 

ctgtctaaga 

aagaagaacc 

gtggagatgg 

cagtacgagg 

agtgcagagc 

gagatcacgg 

agcatgaaag 

ctgcagcaga 

gagatggagt 

caggagatcg 

ccgtagcacc 

tggtcgccgc 

cttcagcacc 

ggacccaagg 

gggccctctg 

tggtttctca 

gcaaacataa 

<210> 358 



ggcccaggcc 
tccgcctgca 
tgggaggagg 
ggggctatgg 
gtggctatgg 

ggggttttgc 

atgagaagat 

gcgccctgga 

agagcccagc 

gggacaagat 

ccaggctggc 

gcgtggaggc 

ctgacctgga 

atgaagagga 

atgccacccc 

ccatggcaga 

tgaacaagga 

agctcaggcg 

cggggctgga 

tccagggact 

gccagaacca 

ccacctaccg 

tctgttacca 

acttctgatg 

cagaggagga 

ctcagtccct 

tgctcttctc 

acttctctac 

aaa 



aagcaagctt 
gagctcctct 
ccgtggtgtc 
aggcggcgtg 
aggtggcctt 
tggtggcttt 
caccatgcag 
ggaggccaac 
tagccctgag 
cctgaccgcc 
tgtggacgac 
cgacatcaac 
gatgcagatc 
gatgaaggaa 
aggcattgac 
gaggaaccgc 
ggtgtctacc 

cacgctccaa 
gaacacggtg 
catcagcagc 
agagtacaag 
cagcctgctc 
cgacttctag 
tccgtaggcc 
gagagccggc 
ctgctctcag 
ttccggtcgg 
cccaaagaaa 



ctatctgcac 

gccagctatg 

tctacctgtt 

agctgtggtt 

ggaggtggct 

gttgactttg 

aacctcaacg 

gctgacctgg 

cgggactaca 

accattgaaa 

ttcaggctca 

ggcctgcgcc 

gagagcctga 

tttagcaacc 

ctgacccgcg 

cgggatgctg 

aacactgcca 

ggcctggaga 

gcagagacgg 

atcgaggccc 

atgctgctgg 

gagggccagg 

tgcctctgtt 

ttaaatctgc 

agttccctgc 

gaccccctgt 

atctctctcc 

agattattca 



ctgctctcaa 

gaggtggttt 

caactcggtt 

ttggtggagg 

atggaggtgg 

gtgcttgtga 

accgcctggc 

aggtgaagat 

gcccctacta 

acaaccgggt 

agtatgagaa 

gggtgctgga 

atgaagagct 

aggtggtcgg 

tgctggcaga 

aggaatggtt 

tgattcagac 

ttgagctgca 

agtgccgcta 

agctgagcga 

acatcaagac 

acgccaagaa 

accaccacct 

ctggcgtccc 

aggagagagg 

cctgactctc 

tctctgacct 

ataaagtttc 



tcctgctctc 

cgggggtggc 

tgtgtctggg 

ggctggtagt 

ccttggaggt 

tggcggcctc 

ttcctacctg 

ccgtgactgg 

caagaccatt 

catcctggag 

tgagctggcc 

tgagctcact 

agcctacatg 

ccaggtcaac 

gatgagggag 

ccacgccaag 

cagcaagaca 

gtcccagctg 

tgccctgcag 

gctccgcagt 

acgtctggag 

gcgtcagccc 

ctaatgcctc 

ctccctctgt 

aggggctgct 

tcctgatggt 

ggatacgctt 

ctgcctttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
66<^ 
72^ 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1693 



<211> 1709 



BHC03 1 051 



tgtgccctcc 
ctcaccctgc 
tggggctggg 
aaggaagacc 
acttaagccc 
gtgatgtttc 
tagagctgta 
<210> 355 

<211> 2051 



ccatgcgtga 
ccccagggcc 
cagcctggta 
ctgagcagga 
tcatcacttc 

tggttgtctg 

ctcagtggcc 



cccgcaccgt 
gctactgaag 
ttcagtggcc 
ccgtggatca 
aaaactgcct 
tgctgcctca 
ttgcaaac 



- 177- 

ctgtgtgcca 
tccctttgtg 
accagaagag 
cctgcaacaa 
cttttttcca 
aagagcgtgt 



cgcactgttg 
ccagtggatc 
cagggccagc 
gctctgatac 

tgggtgaact 

gttcttagtt 



gcatgccttg 
ctggagggcc 
cccggtcagc 
tccaggggat 
gttctctttg 
aactggcaaa 



1380 
1440 
1500 
1560 
1620 
1680 
1708 



<212> DNA 

<213> Homo sapiens 



<400> 355 
tgggctcaaa 

ggggtggtgg 

ttttcacagg 
ttaatgagga 
tctggggtaa 
catctgagtt 
tctcaagagc 
cagctcttgc 
gggtctgggc 
tggaggcttc 
tggcaatgag 
ggtgcgtcag 
gcagcaggtc 
ccagaagaag 
tgccaaattg 
gctggtggag 
gtctgacctg 
ccatgaggag 
ggacgctgcc 
aaccctggtg 
gctgaaccag 
cgagctgaga 
agatgctttg 
gatgcagtgc 
gcggcagaac 
caacacgtac 
acctgactac 
tagtgcagcc 
gttctgagag 
tctacccaaa 
ggtctggctg 
aaggccggct 
gttctcaggc 
aatgcagtag 
cctaacttgc 
<210> 356 



tgacaggtcc 
agatgagtct 
aaactaaata 
gaaaacactt 
ggagctgctt 
gcaggggcca 
ccaggagggg 
aagcttccaa 
agaagcagct 
gctaccagct 
aaggagacca 
ctggagcagg 
ccctacatgt 
actctatgca 
gctgcagatg 
tcagacatca 
gaggcccagg 
gaagtgaact 
ccacctgttg 
gagaataacc 
caggtggtgt 
cgcacggtca 
gaatccaccc 
atgatcacca 
caggagtacc 

cggggcctgc 

tcaccctcca 
cgcacaaact 
cgggtgaccc 



aaaaggcttt 
gcccccaaag 
tgcctcctgg 
agggcttttg 



ctttgaagaa 
tgacatagta 
acaggttaat 
tgtaaaggcc 
ctctgctttg 
tggcttccaa 
ccagtggggg 
gtctctcccc 
acagggccac 
acagtggggg 
tgcaatccct 
agaacgccag 
gccctgacta 
gcaaggctga 
acttcaggac 
acggcctgcg 
tggagtccct 
cactgcgctg 
acctgaaccg 
gccgggatgc 
ccagctcaga 
acgccctgga 
tggcagagac 
acgtggaggc 
aggtgctgct 
tggagagtga 
agtcatgcct 
gcagcccccg 
agatggccat 



g^-sgcccaat 
ctgcaaatac 
gtctcaactc 
gtcaggtttt 
ttggcagaac 



ctgacaggga 
atggggctcc 
gaggggtttt 
aaccaatgcc 

gggtgtctgg 

atgcctcaag 
ctccactcgt 
tgtggccaga 
cagctgcctc 

tgggggctgg 
gaacgaccgc 
cctggagagc 
ccagtcctac 
gaatgccagg 
caagtatgag 
caggatcctg 
gaaggaggag 
ccaacttggt 
agttctggag 
tgaagactgg 
gcagttgcag 
gattgagctg 
ggaggcccgc 
ccagctggcc 
ggacgtccgg 
ggacagcaag 
tccctgtctt 
ccccatttgt 
ggctattgtc 
cccctccctt 
atgccctaaa 
ctcatcattt 
ccttctaggt 
aataaagtgc 



cttggtcttc 
tattttatga 
aatagaaagt 
cactggggca 
cctcagagac 
gccggcttct 
gtgtccgcaa 
agtttctctg 
cctgctctct 
tttggggagg 
ctggccggct 
cgcatccgtg 
ttccggacca 
ctggtggtgg 
acggaggtgt 
gatgacctga 
ctgctctgcc 
gaccgcctca 
gagatgaggt 
ttggacaccc 
tcctgccagg 
caggctcagc 
tatagctccc 
gagatccggg 
gcccggctgg 
ctcccctgta 
cctgcggcct 
gtgccctgcc 
tccagggctt 
cgcgcagagc 
gtttctcaga 
caatgggtgc 
gctgttccgg 
atttgctcag 



ccagtgatgt 
gactggtaaa 
ggtgttaact 
gagatataaa 
ctatcaattg 
cttctgggtc 
tgtactccag 
cctgctcagt 
gcctccctgc 
gcatcctcac 
acctggagaa 
agtggtgtga 
tcgaggagct 
agattgacaa 
ccctgcggca 
ccctgtgcaa 
tgaagaagaa 
atgttgaggt 
gccagtatga 
agagtgagga 
cagagatcat 
acagcatgag 
agctggccca 
ctgacctgga 
agtgtgagat 
acccatgtgc 
cctgcggtcc 
cagggggtcg 
gaacttggcc 
ccaggcccag 
gcctgtcaca 
cagggtctct 
tggattctga 
gcccctgatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2051 



<211> 1404 

<212> DNA 

<213> Homo sapiens 



<400> 356 

IcS™^ « 9a ^ 9 ? aC ccctaccttc tccactgggt ctatcaaggg cctctgtggc 
acagcaggcg gcatctctcg ggtgtcctcc atccgttctg tgggctclti caggg?cc?c 



60 
120 



BHC03 1 051 



-176 



gagctccaac 
gacaatgcca 
cgccagctgg 
gccaaggccg 
agcaaccacg 
gagctggaca 
tatgaggcca 
gaaggcatca 
atcctggagc 
ttgaaggact 
gcccagatgc 
ctggagcggc 
gagattgcca 
tgctccacgt 
tgcaccacct 
gtggacccac 
ttagaaaatc 
cagggacagt 
gctctatgcc 
gagaggcata 
ttgcacttca 
aagaacagac 
gcctggatta 
ttctaattcc 
tttccttgta 
caggcttgcc 
atgtgaaagg 
tccagacttg 
caatgacttc 
ggtagagatt 
ttaccatgat 
tgctcttcct 
atgttccttc 
<210> 354 



agaagatcct 
agctggctgc 
tggaggcaga 
acctggaggc 
agcaggaagt 
ttgagcccac 
tgttggagac 
gcctgcagga 
tgagatgcac 
gtctgcagaa 
agagcctcat 
aaaaccagga 
cgtaccggaa 
ctccctcctg 
gtgggcccac 
aggggctggc 
tggcttctaa 
ggaatcttga 
ctcagtggcfc 
aggttgaaaa 
agaacaccaa 
ttcactgtct 
tggccctagg 
cttctcctat 
tcaaatcctg 
ttttctgtgg 
aactttcaga 
tctggctata 
atttttttct 
tgctaagtat 
tcagccacac 
tctctctggg 
ctctgag 



gtgcagcaag 
cgatgacttt 
caagtgtggg 
ccagcaggag 
aaagattctg 
cattgacctg 
caaccgccag 
catgtcctgc 
ggtgaatgcc 
ctccctgtgt 
cagcaacgtg 
gtaccaggtg 
ccttctggaa 
cgtgactgcc 
ctgtggagcc 
taaggcgagg 
ctttctgtat 
tggaaatcct 
ggattgcagc 
gcattttgga 
ctttaacttg 
gagatttccc 
gagagggaaa 
cctcctgcct 
agttttctct 
caacacagcc 
ttaagaaatt 
acccctgtcc 
agtactttct 
gatgtgcttc 
tggcctttct 
gtgctcttcc 



gccgagaatg 
aggatcaagc 
acacagaagc 
tccctgaagg 
aggagtcagc 
aacagggtgc 
gatgtggaac 
tccgaggagc 
ctggaggtgg 
gaagccgagg 
gaggagcagc 
ctgctggacg 
agcgaggact 
ccctgtgctc 
agcaccaccg 
gatacccaaa 
gtataggcct 
tcctttcctg 
cacccatggt 
gtaggaacac 
aaatttttat 
acatttcttg 
tagaatcaga 
ccctatcttg 
acttcagggt 
tcttacatat 
tctctctttt 
ctgtctaatt 
cctgaagtat 
cacctcttct 
gttccttgga 
ccctccttat 



ccaggctgat 
tggagagtga 
tcctggatga 
aggagcagct 
tgggggagaa 
tgggggagat 
agtggttcca 
tgcagtgctg 
agcgccaagc 
accgcttcgg 
tgtctgagat 
tgaagacccg 
gcaaactccc 
ctcgcccaag 
gaagccgatt 
gagagatgct 
gtccaaaggc 
ctcttggttt 
ggctcagaat 
ggcatgattt 
agtttatgac 
ggttgtttct 
attgtacctg 
gaaactgatt 
ttctagcttt 
ggcttccata 
ttctctataa 
ctcccagtct 
tgaaacctat 
cttcatactc 
catgcctgga 
aaatgttcag 



tgtacaaatt 
gcgctccctg 
tgcgaccctg 
ctccctcaag 
gctccggatt 
gcgggctcag 
agcccagtct 
ccagtcggag 
ccagcacacc 
cacggagctg 
ccgggccgac 
gctggagaat 
ctgcaatccg 
ctgtggcccc 
ctgaattcct 
tgttatacct 
tatgagatac 
tcccaggtga 
atctggattg 
. aaaattgcac 
aaaggaagtg 
aacattggca 
accctgtctt 
aagtggaaac 
gacaaaggac 
gcttcttgct 
attctgactc 
aaaatctaaa 
tgattcaatg 
tctaccttcc 
gacctgtccc 
gtcagcggaa 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220^ 

228C^ 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2837 



<211> 1708 



<212> DNA 

<213> Homo sapiens 



ggcggcatgg ggtctgtaga catccaggta gctgtggctg aggagaaagg gcctctccaa 6% 
catScatcc tcctgctgtg tcaccaacaa cttgcaagcc tctctcaaga ^tgcccccg 12 0 
gcctgcctcg gtctgttcca gcggcgtgaa ctgccggcct gagctgtgcc tgggctatgt 
ctgccagccc atggcatgcc tgccttcggt ctgcctgccc accaccttcc ggccagccag 
ctqcctctcc aaaacctatc tatccagttc ctgccaggca gccagtggca tctccggctc 
catgggcccc ggcagctggt acagcgaagg ggccttcaat ggcaatgaga aggaaaccat 
gcaittcctt aacgaccgcc tggccagcta cctgacgagg gtgcggcagc tggagcagga 
laatgcggag ctggagagca ggatccaaga ggcctctcac tcccaggtgc Jcaccatgac 
tcctgactac cagtctcatt tcaggaccat tgaggagctc cagcagaaga ttctgtgtac 
caaggcagag aatgccagga tggttgtgaa cattgataat gccaaactgg ctgccgatga 
cttcagglcl aaglacgagg cagagctggc catgcggcag ctggtggagg ccgacatcaa 
tggcctpgc agiatcItH atiatctcac tctgtgcaag gctgacctgg aggcccaggt 
tgagtclclg alggaggajc tgatgtgcct caaaaagaac catgaggagg aagtcggttc 
ccttcgatgc cagcttgggg accgccttaa catcgaggtg gacgctgcac ccccggtgga 
cctgaccagg gtgctggagg agatgcggtg tcagtacgag gccatggtgg aggccaaccg 
caSgacg?? ^algallggE tlaatatgca gatggaggag cttaaccaac aggtggccac 
aalc?c4ai LPttcSa actaccagtc agacatcatt gacctgagac gcacggtcaa 1020 
cafgctglag atcgagctic aggcccagca cagcctgagg gactccctgg aaaacacgct 1080 
gaclgaglg? gaglclcglt acagctccca gctggcccag atgcagtgca tgatcaccaa 1140 
cgt?galg?c cagctggltg agatccgggc tgacctggag cggcagaacc aggagtacca 
ggtgctgctg gacgtccggg cccggctgga gggcgagatc aacacgtacc ggagcctgct 
Iga|ag?gal iac?gcallc tgccctgtaa cccatgctcc actccttcct gcaccacctg 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1140 
1200 
1260 
1320 
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<213> Homo sapiens 



<400> 352 



m in m in m sso 

sssss sssa sgfis 51 is? =ssf 
sSS SSi sggg S5 I5S 

•sss ssss =1 |S 15= ss 

atgcagagct g^agaccacl !ca'cga?ec? SKS?" 1 

ccgactacca gtcctacttc cgtacaatcg ILIgctccf ScShr ^? t9t9C ° 

ss-a ss«s S 1F S " 
sssss sssss Is 2 s Sir- sssss issis 

Se SB III pi SEE Ess 

SSI sis ill H i iSI SB 

sssss Es£s ssls I is sir Is 
suss sssassS ssiis ill* ~ss sssss 
sssss sssss is i i Sis ~« 

SSSf"" — "«~ aaaa C9 «aS S^SSS S?SS gSSS 



<211> 2837 

<212> DNA 

<213> Homo sapiens 



<400> 353 



cl?aaaglgg S^SSg E? ^ 9tCtCCa 9 a 9 tggacacatc 60 

gaaagataat aaaaajct?? tgac??ccca 3X222 S^UE* 120 

gaggcctcag attcgfcaac a??cctt?aa cgagc?gga aaSStc^ a 3 * 9 * 9 ^** 180 

gcagctgagt aacaggatga tggcaccaca acalo^f 3 aa ° ttttcat 99taaactca 240 

gcattggagc caacalccgc cUSalSSo o^ff^f aa t caa 99ca ggaggtcaaa 300 

cagtctcagt ggttcagctt tSccfgcfof Ste****? 9 9^999 a gga 9aggaggctt 360 

accatgacct Iftccticag caSSIStS c t9t9cagcc tcagcccaac 420 

gcaagaaatg tctctgtctc tc?ca?cgac a?tgggtg?? E22££ 9gctcctgga 480 

attgccccca tgtgcctttt gqccaacata ocS« a 9cctggggc agaggccaac 54 0 

actcccctgg gcclccccag totH^Sta cca^f- t acc 9 a 9tccg tgtggggtcc 600 

ccagggacll gccLat?c? Scaacat? S££££Z SSS? 0 ? 0 "=9 tc «»"S 660 

ctgaatggcc atgagaagga glccatgcag ??cctqaSa SSSS " t 9 aaaa ° a cc 720 

gagaaggtgc gccagctgga gcaggalaat aSSSSS acc 9 cc t99c caactacctg 780 

agcaagtgcc acgag'tccfc WtSS JSSg? 3S£2 SSSJS !So° 
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174- 



agtttacaac 
gcagggatag 
gccagaagcc 
aattttcctt 
gtgctgcagc 
gcctaagatg 
gggtgctgga 
cttctagagc 
aatagggaat 
cactgtgttg 
gcaggtgctg 
tctcaggtgc 
cactacacag 
gctgcgagct 
ctttccctgt 
ctgtggcctg 
gctggagtcc 
cccttggtgt 
aacggaggct 
gagattgctt 
cttcagagga 
tgaagggcag 
acactctaag 
ttgacctgag 
atcgccagga 
gattggcctc 
tcagcctgca 
agttgagacg 
gtccctctgc 
ggtcagtaca 
ttggaagcac 
ggctctgatc 
agggagttag 
tggagatcca 
acagaactcc 
cctcatcaac 
ccaggagtac 
ccggaacctt 
tgtgcaaaaa 

ggggtggcat 
taatgcctgc 
tgctgaccac 
gcccagcaca 
tctttgtgct 
agttccacaa 
gaacactatg 
tcctctggcc 
cctggcacac 
ttgccccttt 
ctaaagtgtc 
ccttttttct 
tagtccagcc 
aactggatac 
cgttaggctt 
ggagactgaa 
cctatggctg 
tgttgcctcc 
cttcgcttct 
tgagagcctt 
ctcgccagag 
tgcggccaaa 
<210> 352 



atgtgctaga 
aatctcctcc 
atgtgggtac 
tcttgaagca 
ttgattaatt 
aaaacatgtg 
agggcattgt 
taaagaaagc 
ccagggtccc 
caaatgcctg 
gctgtgtgtc 
ttagtattga 
gataagccaa 
gggagtcatg 
tacaggtacg 
cgcagggtgc 
ctgaaggagg 
gatgcattga 
tccttctgct 
ctcatccaca 



gcacagggaa 
ttgggggccc 
gggtcagctg 
tagagttctg 
tgtggagcag 
catctgggca 
ggccatgtcc 
ctcggtgaac 
acgcatggga 
ggcctcgaat 
ttccctgtag 
ttaaaggtct 
attgctctct 
tgatttttaa 
ctgtgtgaag 
aatgtggagg 
caggtgctgc 
ctggagagtg 
cagataactt 
tgcagatccc 
ctaggcgtag 
gttgtgctcc 
ttcaggtttt 
catagcattt 
tagccctgtg 
aggggctgtg 
tttctgcctt 
aggtacactg 
caggcttaga 
tataatatac 
ctatgcagat 
cctgcagcct 
tgacacactc 
tgctctgcag 
gagagacagt 
gatttgtttc 
tgatctctct 
gcagcagaat 
gctgggaact 
tggttctggg 
gtg 



atgcatcatc 
acaggtgggg 
cgagtggcag 
atttcctgtg 
tgtgagtgct 
aaagtcaaag 
gttttatcat 
catttccttc 
tttgggtatg 
gttactttgc 
catcacagta 
ctgagtgggg 
ctcacaatta 
tcctggtgcc 
agagagagcg 
tagacaacct 
agctgctttg 
gactttctgg 
tgggattgaa 
gcttctggaa 
attactgggg 
tgcaggcggg 
ggagacaagc 
ggggagacgt 
tggttccaag 
ggtaggtgac 
tgctctgagg 
gccctggagg 
gacccagctt 
accaaccacg 
atgtggatgc 
accatgaagt 
ggacctggca 
cttgcttcct 
ccgaggaccg 
aacagctgtc 
tggatgtgaa 
aggactgcaa 
ctcttgggaa 
atctttggag 
gtgagggtgt 
cagttgctgg 
gctaaattct 
aaagtattgc 
aaaagcatca 
tttaaatgga 
tctctctgct 
agttctggcc 
gggacctcta 
ttttgtcttt 
ttccctgcaa 
gcgccccctg 
ctcagcccga 
gcctgctaac 
gccactctgt 
ttcccctcaa 
gagatggggc 
gagcccagtg 
ccttgagcca 
agatggcaaa 



cacctggtaa 
accctgaggc 
agctgggact 
aaactgttgc 
gataattcat 
ggttgacacc 
ctttcttcgg 
ctctcatgct 
gggcttgtag 
cacctccctt 
tcctcagtac 
aacatcctgt 
gatggcaaag 
ccttcctgcc 
ctcgctgcaa 
caccctcgcc 
cctcaagaag 
aggtgaattc 
gctttgcagg 
gctgctaggc 
aggatggaaa 
tcatactcct 
tccggataga 
gaggccagta 
cccaggtgag 
catgtttctc 
agctgcagtg 
tggagcttca 
ggctcccact 
gactcagctt 
tgagtgaatc 
gtgttagtta 
atgtcttggg 
cctttctttg 
ctaccgcaca 
tgagatccgg 
ggcccggctg 
gtacgtaccc 
tgacctaaaa 
acaaacttgc 
acaggagttt 
ctactcccat 
ctccaggaga 
taagggcatg 
gagggctgat 
atcagtggct 
gtgttccttc 
ccagctccac 
aggtctaagg 
cacatacttc 
cccgtgtgca 
ctcccgggcc 
ggtgctggga 
ctgggccttg 
ggataggtgt 
ggtgtcccaa 
aggcactttt 
cagtccagcc 
catgcttctg 
gccragtgaag 



gcaacttgtg 
ttggggtgct 
tggccttaaa 
cttaataagg 
agcagaaatc 
acctgaggag 
ggcactgtgc 
ggcaatgtgg 
agaggggcct 
ttgggccatt 
cagctgctcc 
tgctttctgg 
ggcaggccag 
ctctaactgt 
cagctggtgg 
aagtgtgacc 
aaccatgagc 
aggctcatct 
tgagctctgg 
tggtgcaggc 
gtgggagttg 
ctgatgccta 
gctggacatt 
cgaggccatg 
ggggactgct 
tatgctccag 
ctgccagtca 
ggctcagcac 
acccatgagt 
aaaccttggt 
aacattaact 
ttctggctga 
gtgtcacaat 
atattataga 
gagctggccc 
gctgacctgg 
cagaatgaga 
tgttgtctat 
ggagctctgg 
taccactagg 
tcaggtcatg 
agagggctaa 
agtattccaa 
tgcactgact 
ataccccttt 
gacatctgga 
catatgggac 
tgtggactgt 
tcctcttttg 
tcatgccaat 
accccagcct 
acccatgggc 
aaggggaggt 
cctccctggc 
ttgaatccag 
gtgctccatt 
gtttttattt 
tcagcatctc 
tagacggggg 
gctagagtta 



aagcaggcgg 
ggtgtgactt 
ttttgtacca 
atatgtttct 
atgctccata 
ggtctaccct 
aagtctcact 
cttttactct 
gcagtgattg 
gtgtgtaaag 
ccttggttca 
gctttgagca 
tggacttaca 
cactcctgca 
aggctgacat 
tggaggccca 
aggtgtgtga 

ggggg9 aaaa 

cttttggtgg 
a gcggtttat 
agagtttggg 
caggaagccc 
gagcccacca 
gtggagacca 
aagcagtggg 
tctgaaggca 
gagatcctgg 
acactggtga 
agctctctgg 
ttcaactctt 
taggtcagag 
agacagctaa 
cttaagaggt 
aggactgtct 
agatgcagag 
agcggcagaa 
ttgccacata 
gtcaacaccg 
aggccaggtg 
cctggtgagc 
gcatttgttt 
aatactgaat 
taacgagtat 
ctttcatttt 
aagcactaag 
ccattcttgt 
gtgaccctgg 
gggcacgtca 
cactttaatt 
gtttcaaaca 
tcagcactcc 
cctgctcatc 
gtcttcagct 
cagccaggca 
cagctcagcc 
ttctctcttc 
tgctattaaa 
cgtcccatgc 
ttgtggctgg 
aaaatgggac 



3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 . 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380^ 

4440^ 

4500 

4560 

4620 

4680 

4740. 

4800 

4860, 

4920; 

4980L 
5040; 

siotf 

5160; 
52201 
5280;. 
534Cf 
5400; 
5460 
5520 
5580 
5640 
570(^ 

5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6673 



<211> 1986 



<212> DNA 
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aggggcctga aaacatcatt tgcatggctg gagttgcccg cgtaaggcag ccaaaaaact 
cacccaaagc ctgtagcctc cccaactact ccagaltgtl cfgctcacc? ?£5£tecE 
gggggtctgt tccttcctat gctcacccag agaactcEct gaEgtgeeag tgggcctccc 
<2^> aC 35? Ctaataaata tcatttcott ggcaaaaaaa LaLLaal aSaaa 



1440 
1500 
1560 
1616 



<211> 6673 

<212> DNA 

<213> Homo sapiens 



<400> 351 



aagaccatgg 
gcttggggat 
aatgtttgat 
aaaaatatca 
tctcagtgag 
cttcctttta 
tctaaaatct 
acaaatatct 
actaatattt 
tgagagagac 
gctgacagaa 
ttttctctgt 
gagctcatta 
attgacataa 
tatataaggt 
gcaggatctt 
aggtttcaga 
gagtccgtgt 
acaccacttg 
cctaccgcga 
gcctggccaa 
ccaaactcca 
acttctgcac 
gtcaggaatc 
aagttttatt 
actacagtgg 
atttactatc 
tttttctttt 
atgagatgta 
taaaattcct 
eg t get gag t 
aaggaatatt 
tettatgeat 
ttcttagttt 
tagattcaga 
acagatccta 
tttttttttt 
atctcagctc 
tgagtagctg 
gagatggtgt 
tetgecttgg 
cttcttattt 
atgttcagtt 
ttccaagaag 

gggagtttgt 

tcacctgagg 
ccccagattc 
aaattggctg 
gecaagtgag 
catgtggcat 
aaagggcatt 



tcctggtatt 
aatgtctcca 
tgtgtttatt 
ttcaaaacat 
ctctcttcca 
ctgtcttcat 
ccttcttaca 
ateaaaacaa 
taggttttgc 
ttgeaggaca 
ggggaaagat 
tttccaggac 
actctttata 
atattaatga 
ctacaggaga 
gecttttage 
ggccaacgct 

ggggtcgact 

tcccttgcca 
aaacaccctg 
ctacctggag 
tgagaggagc 
catccaggag 
tgcttactgc 
ggttcatggc 
cagagttcag 
tgacccttta 
aatgtgggaa 
gaatttatca 
aaatcgtagt 
ttatgaaagt 
tcctaacatc 
attctattgt 
tggagacaag 
gaagacctgt 
ctttatctaa 
ttttgagaca 
actgeaaact 
ggattacaag 
ttcaccatgt 
tctcccaaag 
atattgggct 
tgctactgtc 
tctcttgcac 
gttgtccagg 
ctataggtgt 
tgtgcaecaa 
cagatgactt 
agttggggtt 
eggatgtcat 
aagctcaaga 



taagactttt 
ttataaatac 
ttgttctcta 
atttggaaaa 
cctcgaaatc 
aatatcaaga 
agacagcata 
acaaatcatg 
tgcattggga 
ttttaggagc 
ttcacagcca 
cctgcagtga 
ggaaacaaaa 
ggaaaatgee 
aggaagattt 
accatgactt 
gcctctctgt 
cccctgggcc 
gggacctgcc 
aacggccacg 
aaggtgcgcc 
aagtgccacg 
ctccagcaga 
ctgtcggcca 
catgetcatg 
tcattgtaat 
caaaaaaaga 
gagaatctct 
ctctaggatt 
cacaaaaatc 
tgeagtatae 
ettgecaagg 
cacctttgat 
aagacaagga 

ggggaggttg 

aacattcata 
aagtctcget 
ctgcctccca 
tgtgcgccac 
tgcccaggct 
tgeggggatt 
aagcatcata 
ttccatctcc 
atcttccttc 
eggatgetgt 
ctctgggatc 
ateggagaac 
caggaccaag 
ggatggggat 
tgacccagac 
tcctgaaaaa 



cacaacctgg 
caacacaact 
tgagagaagc 
tcccctactc 
tttgcagctg 
tcatttactt 
gatgtgcgtt 
atttttccct 
acattcagag 
ctttagttct 
cagaaagtgg 
tgggaagaaa 
accactgact 
agttttcttg 
tggtgtcaag 
ctgaccattg 
gcctcttggc 
gcctcagcct 
acattcctgg 
agaaggagac 
agctggagtg 
agtccagcgt 
aggtgaggtt 
aatccggccc 
catttgtatg 
ggagactgta 
ttactgaccc. 
tgagtgaagg 
ctgagttcat 
atgctatttt 
tgaaactttc 
cagtgtctat 
fcatccagtta 
atgtcaaaat 
gacaaecaga 
gccagtcctc 
ctgtcaccca 
ggttcaagtg 
cactcctagc 
ggtcttgaac 
acaggcatga 
tctggcaaaa 
tggttccttt 
ccacaccaca 
aggttcaact 
agtaagfccac 
aataggctgg 
tgagttgggg 
gggacaatgc 
cccaacccag 
ctatccgctc 



actcaatgea 
agactatttt 
tcattcccca 
catgaagtct 
tcttttaaaa 
ttttatcttt 
tgttccgcac 
ttcttcaatg 
gtaggaaaag 
tgattatttc 
getgettgaa 
gaggttaggg 
aatagagtga 
eaccaaeace 
cagatccaac 
cagttccctc 
taatgtggca 
ctgtctgccc 
caacategga 
catgeagtte 
ggacaatgea 
gtgecgaaac 
tgcggtgcac 
accacctgct 
ttgtctgcag 
gggtctgcaa 
tgggtcagca 
tgaatgggaa 
gcttacttaa 
catgaaaacc 

tggtgcagta 

ccacacaaca 



— —3*— w «— \~ CI 

aggggttaca 

ggaaagattc 
tgttctgtgg 
ggctggagtg 
attctcatgt 
taacttttat 
tcctgacctc 
gccactgcgt 
tgaaagcaag 
gactctatgt 
aaagcatctc 
teagecaata 
ttactgagee 
ttgtgcaaat 
ttggacaggg 
caagttggct 
tagttatagc 
tccagtggtt 



actttceggg 
ccattttcca 
accctttata 
ttttccggtc 
aaaaaaaata 
cttattaaaa 

tgggcactct 

tgcctgattg 
aatgtgggag 
atagacataa 
gaagaccttg 
cctctttcat 
ttgtataaca 
cactaaggct 

fc gtgggcaaa 

ctcagcgggc 
catgccaatc 
ccaacctgcc 
atctgtgggg 
ctaaacgaec 
gaactggaga 
taccagtcct 
cagatcaggg 
tttgeaaata 
ctgcttttgt 
agectaaagt 

gtggattttt 

tttgtaaagg 
cttcagcaca 
atgeaaaaac 
aacagggcaa 
gcatcaacct 
agagcetaca 
gaaacctggc 
tttctttaga 
cctagttttt 
cagtggtgca 
ctcagcttcc 
atttttagta 
aggtgatcca 
ccggccagat 
tctttgttat 
cttcatcctt 
ttgcacgttt 
aaacttcctc 
tactttctta 
agacaatgee 
atgggaagga 
gcttctgtct 
agaagagaac 
tagcattaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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<400> 349 



ggaaaggaaa 
cagaggaagg 
attacctgtt 
agctgctcct 
tgcaacatcc 
agcgagaagg 
cgtcagctgg 
caggagccct 
cagaagattc 
aagctggcct 
gtggagtcgg 
gacctggagt 
gaggaggagg 
actgccccca 
ctggtggaaa 
aacaagcagg 
ctgagacgca 
tctctggaaa 
cagagcctga 
cagaaccagg 
acgtaccgga 
accaatgcta 
tgaaaagctt 
gatcctcaag 
tatgttattt 
cttccaatgt 
ttcagctccc 
gcatttcctg 
tacttcccaa 
<210> 350 



ctatgctgta 
agagactcaa 
ccagcaccat 
cccggccctg 
ccgccaatgt 
agactatgca 
agcgggacaa 
tgctgtgccc 
tgtgtgccaa 
ctgacgactt 
acatcaacag 
cccaggtgga 
ttaacaccct 
ctgtggacct 
ttaaccgcag 
tggtatccag 
cagtcaacgc 
acacgctgac 
tcaccaacgt 
agtaccaggt 
gcctcctgga 
gtggcaactc 
gtatcctctt 
tcctgacttt 
ctagaatgct 
ctggacaact 
tgtaatctga 
ttttcctttt 
taaacttcat 



tgccaagccc 
acctgcccac 
gtcttacagt 
cgtgcccccc 
gagcaactgc 
gttcctgaac 
cgcggagctg 
cagctaccag 
ggctgagaat 
cagaagcaag 
catacgcagg 
gtccctgagg 
gcgctcccag 
gaaccaggtc 
ggaagtggag 
ctcagagcag 
cctggagatc 
ggagagcgag 
ggagtctcag 
gctgctggac 
gagtgaggac 
ctgtggaccc 
tgaagacatc 
tctggagctc 
gaaaagcttt 
ccttcctgtt 
gctccagtta 
ttctttggca 
ttctctggca 



ccacccacaa 
atccacctcc 
tgttgcctgc 
agctgccacg 
aactggttct 
gaccgcctgg 
gagaaactca 
tcctacttca 
gccaggctgg 
taccagacgg 
atcctggatg 
gaggagctga 
cttggagacc 
ctgaacgaga 
caatggttcg 
ctgcagtcct 
gagctgcagg 
gcccactaca 
ctggcagaga 
gtgcgtgccc 
tgcaagctcc 
tgtggcacct 
tacaaagcca 
agctgacatc 
cctgacccaa 
gagggtcgag 
ctacttaagg 
ttctctggaa 
taa 



ttaatggtat 
agcagcttac 
ccagcctggg 
gctacaccct 
gtgagggctc 
ccagctacct 
tccaggagcg 
agaccattga 
tggtgaacat 
agcagtccct 
agctgaccct 
tctgcttgaa 
gcctcaacgt 
ccaggagtca 
ccacgcagac 
gccaggcgga 
cccagcacaa 
gctcccagct 
fcccgctgtga 
ggctggagtg 
cctgcaaccc 
ctcaaaaggg 
tttagatcaa 
aagaaacctc 
gcaaagacac 
cctgtttgtt 
tgtttcctga 
tgcaaggagg 



aaaaggactg 60 

ctgcttttcc 120 

ctgccgcacc 180 

gcctggggcc 240 

cttcaatggc 300 

ggagaaggtg 360 

gtcccagcag 420 

ggagctccag 480 

tgacaatgcc 540 

gaggctgttg 600 

ctgcaagtct 660 

gaagaaccat 720 

ggaggtggac 780 

gtatgaggct 84 0 

cgaggagctg 900 

gatcatcgag 960 

cctgcgagac 1020 

gtcccaggtg 1080 

cctggagcgg 114 Og 

tgagatcaac 12001 

atgcgccacc 1260 

ttgctgtaat .1320 

ccacaggaag 1380 

atcttgcctc .1440 

acatcatcaa TL500 

tctaaagatg . r 1560 

acgtactact il62 0 

agacttcatt 1680. 

1713 



<211> 1616 

<212> DNA 

<213> Homo sapiens 



c400> 350 

gagaatttag 

acaacttctg 

cccccagctg 

actgcaactg 

tgaacgaccg 

agctggagaa 

accagtccta 

agaatgccag 

ccaagtacca 

gcaggatcct 

tgaaggagga 

gccagcttgg 

gggtgctgaa 

tggagcaatg 

agcagctgca 

agatcgagct 

gtgaggcccg 

cccagctggc 

tggatgtgcg 

aggactgcaa 

gaccctgtct 

ccccacgccc 

gaggagcaag 



actctgtctt 

cctgcccagc 

ccacagctgc 

gttctgcgag 

cctggccagc 

cctcatccgg 

ttttaagacc 

gcttgtggtg 

gaccgagctg 

ggatgagctg 

gctgctctgc 

agaccgcctc 

cgagaccagg 

gttcaccacg 

gtcctaccag 

gcaggcccag 

ctacagctcc 

ggagatccgc 

tgcccggctg 

tctgcccagc 

ctccaatccc 

ccgctgtggg 

gatgcagggc 



cagccaggca 

ctgagctgcc 

accctgcccg 

ggctccttca 

tacctggaga 

gagcggtctc 

attgaggagc 

cagatcgaca 

tccctgcggc 

accctgtgca 

ctcaagagca 

aatgtggagg 

agtcagtatg 

cagaccgagg 

gcggagatca 

cacaacctgc 

cagctgtccc 

agtgacctgg 

gagtgtgaga 

aatccctgtg 

tgtacctctt 

ccctgcaatt 

ccaggactcc 



ctccctccct 

gcaccagctg 

gggcctgcaa 

atggtagcga 

aagtgcgtca 

agcagcagga 

tccagcagaa 

acgccaagct 

agctggtgga 

agtccgacct 

accatgagca 

tggatgctgc 

aggccctggt 

agctgaacaa 

tcgagctgag 

gagactctct 

aggtgcagag 

agcggcagaa 

tcaacacata 

ccacgaccaa 

gtgtccctcc 

ccttcgtgcg 

agagctgtga 



ccctcccagc 

ctcctcccgg 

catccccgcc 

gaaggagact 

gctggagcgg 

gcccttgctg 

gatcctgtgt 

ggctgcggat 

gtcggacatc 

ggaggcccag 

ggaggtcaat 

tcccactgtg 

ggaaaccaac 

gcaggtggta 

acgcacagtc 

ggaaaacacg 

cctgatcacc 

ccaggagtac 

ccggagcctg 

cgcgtgcagc 

tgccccctgc 

ctagaaccta 

cctggctctg 



actatgccct 6^ 
ccctgcgtgc 

aatgtgagca 180 

atgcagttcc 240 

gacaacgcgg 30 0 

tgccccagtt 36 0 

accaagtctg 420 

gatttcagaa 480 

aacggtctgc 54 0 

gtggagtccc 600 

accctgcgct 660 

gacctgaatc 720 

cgcagggaag 780 

tccagctcag 840 

aacgccctgg 900 

ctgacagaga 960 

aacgtggagt 1020 

caggtgctgc 1080 

ctggagagcg 1140 

aagcccatcg 1200 

acaccctgtg 1260 

gggaatgcca 1320 

gttcaacaaa 1380 
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sees ssss sees sees 3SSE SEES e 

I S SUE SSSS SSSS SSSS £S 

gaccattgag gagctccagc agaagatcct gtgcagcaag tctgagaatg ccaqocttat 
ggtgcagatc gacaatgcca agctggcctc agltglcttl aggfc?aaa? ItgSScIa 
gctgtccctg cggcagctgg tggagtcgga catcaatggc cfLgcagga tcISEatX 
SSEf* tgca39tct 9 ^cctggaggc ccaggtgflg tc?c?g2£ aggagSgc? 
SSSS" cagaaccat 9 agcaggaggt taacaccct£ cgctgScag? 52E25 
cctcaacgtg gaggtggacg ctgctcccac tgtggacctg aacciggtcc tgllSaoac 
caggagtcag tatgaggccc tggtggaaac caaccgcagg gaagtSgagc StSgttSc 
cacgcagacc gaggagctga acaagcaggt ggtatccap IcggagSalc tocaqtccfa 
ccaggcggag atcatcgagc tgagacgcac ggtcaatglc ctgagaSg afcScaSqc 
ccagcacaac ctgcgagact ctctggaaaa cacgctgaca gaSIgSgagl c?cgfSS£ 

ssss see sssss ssss S 
ssss ss= ssss ssss s I 

S" 9 !?e ctac 9 t 9<=" sotgtgsgco ttgcaaSac. eteS£S j* atctetM 



<211> 1621 

<212> DNA 

<213> Homo sapiens 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1252 



<400> 348 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



ip ess 55 ins SSI =~~ h. 

i§S ips SIS 53S SSS iES 
H 35 ISS 255 S5S ISSS 

iiiii&liiiSiss 

iSiiiiiiiiiisi i 

ssss sssss ssSs: »sS ££3 35H ss 

iillflislllllliilt i 
pss hss sss sis iH iiii I 

<210> 349 1621 
<211> 1713 
<212> DNA 



<213> Homo sapiens 



BHC03 1 051 

- 170 - 

<212> DNA 

<213> Homo sapiens 

<400> 345 



atg?ctggca gttgctcttc taggaaatgc ttctccgtgc cagccacctc tctctgctcc 
l^ltlatll gctgtggagg ccccatctgc ctgcccagtt cctgccagag ccagacatgg 
™«SS2£ IttgtSaga cagctgtgga tcatccagct gtgggccaca gtgccgtcag 
ccctcc?g?c ctg?gagtag ctgtglccla cccctgtgct gtgatcctgt catttgtgag 
ScttcttSct ccgtgagcag cggctgccaa cccgtgtgct gtgaggccac cacctgtgag 
ccttcttgct c?gtgagcal clgctaccaa cctgtgtgct tcgaggccac catctgtgag 
ccttcttgct cagtgagcaa ctgctgccaa cctgtgtgct ttgaggccac cgtttgtgag 
cct^cttltt ccgtgagcag ctgtgltcaa cctgtgtgct gtgagcctgc tatttgtgag 
ccttc"gct ccgtgagcal ctlclgccag cctgtaggct ctgaagccac "cctgccaa 
ccaatcctct gtatgcccac ttcctgccag cctgtcctct gcaaatccag ctgctgccag 
ccagt££ct Sgagcccag ctgctgttca gctgtctgca ccctgcctag ttcctgccaa 
cc?gtggtct gtgagccttc ctgctgtcag ccggtgtgcc cgacacctac ^^tctgtg 
accag?lgct gccaggctgt ctgctgtgac cccagccctt ggtcttctgc atctgccatc 
^tntlc^ll Ittatcctag gactttctac atacccagtt ccagcaaacg gccttgcagt 
Inttllttll ccKccgccl ggtctcccgt ccgatctgcc gcccaatctg ctctggactc 
c?cacSXta ggcagcca?a Stgacatcc atltcctacc gtcctgcctg ctatcgccca 
g SS ?cctgcgccg ccclgcctgt gtcacttcct actcttgccg cccagtctac 
ttocacccat cttgcactga gtctgactct tgcaaacggg attgcaaaaa atccactccc 
aaccaactgg attgtgttga lacaaccccc tgcaaggtgg atgtctcaga agaggctccc 
?gccagcccl ctgaagccla acccatcagc ccaaccaccc gtgaggccgc agcagctcag 



^ w — — " * 

cctgctgcca gcaagcctgc caactgctaa 
<210> 346 

<211> 815 

<212> DNA 

<213> Homo sapiens 



<400> 346 

gaactccggc tgcgttaccg acctgcgagc 
ggggactgct tcacctgctg cacccaggag 
ccgggctgct gtggctgctg cggctcctgc 
tctggctgcg ggggcagctg ctgcggatcg 
ggaggctgcg gaggctgcgg gggtggctgc 
tgcggctccg ggtgctgtgg gcctgtgtgc 
tgaagacctt tccctccacc actgatgcag 
ggacagcccc tcgtgtccag aaccttccat 
atttgtagag gagggcttgt tctccaacac 
agagccatac tctgatgaaa cattaaaact 
ggtgaatccc caatacttta cttattcaca 
cctcttcttc aaggtgtctt tgggactgat 
tgcaaaaaag gtccatcttc tttgtgagcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
<210> 347 

<211> 1252 

<212> DNA 

<213> Homo sapiens 



tgacgggccc ggaccatggg gtgctgcccg 
caaaactgct gtgaagagtg ctgctgtcag 
tgtggctgtg ggggctctgg ctgcgggggc 
tcttgctgtg gatctggctg cggaggctgt 
tgtggatcca gttgctgtgg gtccagttgc 
tgccagccca cacctatatg cgacacaaaa 
tcccaccgaa agcctccatc tgctccaggg 
acccccaaga cagtgtcttg tctttctgtg 
cttctctggt atttcaaggc accgagaaca 
cggtcacaac taagtgatcc caagctcaag 
tgaagttcaa tgtcttgtac tataagacat 
cctccgccct ggctttctgc agctttgaga 
tcttaataaa tttgagcatg ctggcataaa 
aaaaa 



ggactctgJc ttcagctgga cactccctcc ctgcaccatg tcttacagtt ^ggcctgcc 
cagcctglgc tgccgcacca gctgctcctc ccggccctgt gtgcccccca gctgccacgg 
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gctgccagcc 
tccagcccac 
gaacaaccat 
ttaccttact 
ttaaaatctt 
ttataccttg 
ctttgactca 
tgattaagaa 
cttccttctt 
atttcttaat 
<210> 343 

<211> 990 



ctgctgccgc 
ctgtgtgtcc 
cttcacacaa 
gctgacagca 
gtgaatcagc 
tgaatcatgt 
aaagtcaaga 
tcttcacaac 
ttaatgatca 
aaataaattt 



cccgcctgct 
agctgctgcc 
caaccttctg 
accatgttct 
ttgagggagg 
gccagcttca 
gcttcattct 
tatgttttct 
ctttgggtta 
ggaactatta 



gtgagaccac 
agccttcttg 
ctcaactgac 
cacccaaatt 
gcagaatact 
tctgttctca 
cttcacttaa 
tttcaatata 
tctccctata 
ttcataaaaa 



ctgctgcagg 
ctgctgatca 
ttatcttttg 
tttatgaatt 
tcatcctgat 
attttgagtc 
gaaacttaag 
ctcatgattc 
accagggatc 
aaaaaaaaaa 



accacttgct 
agtcccaaga 
gaggactaat 
ctctgcatat 
tctctttttc 
atggtctcag 
ttgctgcaaa 
ttgtatcctg 
ttacctatat 
aaa 



420 
480 
540 
600 
660 
720 
780 
840 
900 
953 



<212> DNA 

<213> Homo sapiens 



<400> 343 

tctgaacaga 

tcagcctaca 

gaccacctgc 

gccttgctgc 

ctgctgccag 

ctgtgaccag 

aactgtctgc 

ctgccgccca 

tcagcctttt 

tgctcaactg 

ctcacccaaa 

gggcagaata 

tcgtgtgttc 

tctctgcttc 

ttgttttcaa 

gttatgtttc 

atcctcaaaa 

<210> 344 



agcccaccct 

tgctgcagga 

agcagcacac 

cgcccaactt 

ccttcctgct 

agcagctcct 

ctgcctggtt 

gcctgctgtg 

tgctgctgat 

acttatcttt 

tttttatgaa 

cttcatcctg 

tcaattttga 

taaggaattt 

tatcctcctc 

tgctaccagc 

aaaaaaaaaa 



ctacccctga 

ccacctgctg 

cctgctgcca 

gctgtcaaaa 

gcagcacacc 

gtgcacctgt 

gcctaaacca 

agaccacttg 

caagtcccaa 

tgggggacta 

ttctctacct 

attctctttt 

gtcatggtct 

aggtttctgc 

atggttcttg 

agagattctt 

aaaaaaaaaa 



caccatgacc 

caggaccacc 

gccctcctgc 

cacctgctgc 

ctgctgccag 

gtactgcaga 

gagctgtggc 

cttccagccc 

gagaaccacc 

atttaatttg 

gtttaaaatc 

tccttacact 

cagctttgac 

aactgatcaa 

tatccttctt 

agctatatgt 



cactgttgct 

tgctggaagc 

tgtgtttcca 

cagcccacct 

cccacctgct 

agaacctgct 

tccaactgct 

acctgtgtgt 

atcctcacac 

ctgctgacag 

ttgggaatct 

ttgtggatca 

tcaaaagtca 

taatctttgc 

tcttcttttc 

ttctgaataa 



ccccttgctg 

ccaccactgt 

gctgctgcca 

gtgtgaccag 

gtgggtccag 

actaccccac 

gccagccctg 

ccagctgctg 

aacaactttc 

ccaccatgct 

acttgaggga 

tgtgccagct 

agagcttcat 

aatcatattt 

ataactttgg 

actctgaacc 



<211> 725 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
990 



<400> 344 



actttggagg ccaggtgtat aaaaggtcca gattgcaaga qatcatca^ m-^k * 

■lis ass ~=~ ssl ill IP s 

acc«ct sg o ,ac=«o C9C =ctcctctt 9 9 a 99 «Xg 52£S £. 

<210> 345 725 



<211> 1230 
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<211> 953 
<212> DNA 
<213> Homo sapiens 



<400> 340 _ _ £n 

tcacctctta acagaagccc accctccatc cctgacacca tgacccactg ttgctcccct 60 

tgctgtcagc ctacctgctg caggaccacc tgctggcagc ccaccactgt gaccacctgc 120 

agcagcacac cctgctgtca gccctcctgc tgtgtttcca gctgctgcca gccttgctgc 180 

cacccaactt gctgtcaaaa cacctgctgt aggaccacct gctgccagcc catctgtgtg 240 

accagctgct gccagccttc ctgctgtagc acaccctgct gccagcccac atgctgtggg 3 00 

tccagctgtg gtcagagcag ctcctgtgca cctgtgtact gcagaagaac ctgctaccac 360 

cccacaagtg tttgtctgcc tggttgccta aaccagagct gtggctccaa ctgctgccag 420 

ccctgctgcc gcccagcctg ctgtgagacc acctgctgca ggaccacttg tttccagccc 480 

acctgtgtgt acagctgctg ccagccttct tgctgctaat caactcccaa gagaactacc 540 

atcctcacac aacaaccttc agctcaactg acttgtcttt tgagggacta atttactttg 600 

ctgctgacag ccaccatgct ctcacccaaa tttttatgaa ttctctacat gtttaaaatc 660 

ttgggaatct gcttgaggga gggcagaata cttcatcctc attccctctt tccttacacc 720 

ttgtggatca Egtgccagct tcgtctgttc ttaatttgga gtcatgatct cagctttgtc 7 80 

tcaaaaatca agagcttcat tctttgcttc taaggaattt aggtttctgc aactgatcaa 840 

tcatctttgc aattatattt tcattttaaa tatccttctc atggttcttg tatccttctt 900 

tcttcttttc acgataactt tgggttatgt ctctggtagc agagattctt acctatatgt 960 

ttctgaataa actctgaacc atcttcatct caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 
<210> 341 I 

<211> 1007 

<212> DNA 

<213> Homo sapiens 



<400> 341 _ 

tctgaacaga actccaccct ctacccctga caccatgacc cactgttgtt ccccttgctg, 60 

tcagcctacg tgctgcagga ccacctgctg gaagcccacc actgtgacca cctgcagcag 120 

cacaccctgc tgccagccct cctgctgtgt gtccagctgc tgccagcctt gctgccgccc.? 180 

aacttgctgt caaaacacct gctgccagcc catctgtgtg accagctgct gccagccttcy 240 

ctgctgcagc acaccctgct gtcagcccac ctgctgtggc caaaccagct gtgggtccag^ 3 00 

ctgtggtcag agcagctcct gtgcacctgt gtactgcaga agaacctgct accaccccac: 360 

gactgtctgc ctgcctggtt gcctaaacca gagctgtggc tccagctgct gccagccctg.? 420 

ctgccgccca gcctgctgtg agaccacctg ctgcaggaca acttgcttcc agcccacctg 480 

tgtgtacagc tgctgccagc cttcttgctg ctgatcaagt cccaagagaa ccaccatact 540 

cacacaacaa atttctgctc aactgactca tcttttgggg gactaattta atttgctgct 600 

gacagccacc atgctttcac ccaaattttt atgaattctc tgcatgttta aaatcttgtg 660 

aatcagcctg agggagggca gaatacttca tcttgattct ctttttcctt acaccttgtg 72a 

gatcatgtgc cagcttcgtc tgttctcaat ttggaatcat gatctcaact ttgactcaaa 78f 

agtcaagagg ttcattctct gcttctaagg aatataggtt tctgcaaccg accaataatt 840 

tttgcaatca catttttgtt ttcaatatcc tcctcatggt tcttgtattc ttctttattc 900 

ttttcatgat aactttgagt tatgtccttg gtaacagaga ttcttaccta tatatttctg 960 

aataaactct taaccatcct catctcaaaa aaaaaaaaaa aaaaaaa 1007 
<210> 342 



<400> 342 

ccatgaccca ctgttgctcc ccttgctgtc agcctacctg ctgcaggacc acctgctgca 60 

ggaccacctg ctggaagccc accactgtga ccacctgcag cagcacaccc tgctgccagc 120 

cctcctgctg tgtgtctagc tgctgccagc cttgctgccg cccagcttgc tgtcaaaaca 180 

cctgctgcag gaccacctgc tgccagccca cctgtctgtc cagctgctgt ggccaaacca 240 

gctgtgggtc cagctgtggc cagagcagct cctgtgcacc tgtgtactgc agaagaacct 300 

gctactaccc gacgactgtc tgcctgcctg gttgcctcaa ccagagctgt ggatccagct 360 
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ttattaacca 
acaacctcag 
actatgttgc 
ccccatgttc 
cctctatctt 
aa 

<210> 338 



gcattcttga 
ttccaaccaa 
taccctctac 
cctgaattga 
aaataaattt 



<211> 754 
<212> DNA 
<213> Homo sapiens 



tatgatccac 
ttcttggatt 
caaatgaata 
aatatttgct 
taatttttga 



ctgtgaactg 
gagtttggcc 
caagtttgaa 
ctctaccata 
ggcatgcaaa 



aatcatgcaa 
tccaaatatg 
ttttctctga 
atttatcaca 
taaaaaaaaa 



ggccaattgg 
ctcaccaaac 
aatatgtcaa 
tggagctatt 
aaaaaaaaaa 



540 
600 
660 
720 
760 
782 



<400> 338 

^^f^P aac ^ taccta gaacctccac cctctgacac catggttaac tcttgttgtg 60 

gctctgtctg ctctgaccag ggctgtgatc aaggcctctg ccaagagacc tgctgccgcc 120 

ccagctgctg ccagaccacc tgttgctgcc ccagctgtgt tgtatccagc tgctgccgcc 180 

catcctgctc tcagactacc tgctgccaga ccacttgctg tcgccccagc tgctgccgcc 240 

tcasaccacc tgccgcccca gctgtggtgt gtccagctgc tgccgtccac 300 

tctgttgtca gaccacctgc cgccccagct gtggtgtgtc cagctgctgc cgtccactct 360 

Cac ^ 9Ct 9 c cgtacaactt gctgccgccc cagctgctgt ggatcctctt 420 

atattggact atcaaccatg agccagtcac catcccatga tatgaaagtg 480 

tcatttatat caatttgtcc atgtgttaaa tggccatcag ctttattatc ctgttattcc 540 

actaagtaac tggtgaaagg gcatccatct acatttaata ccactcattt ttcccatgga 600 

9C f5 tC f?f C ctca 9 at 9<^ tgatgcagtt agactaagcc tctaagtctc 660 

^l^tttg accttcaaaa ttacatatta actgttcttc aataaatatt 720 

atcttaatat cagtaaaaaa aaaaaaaaaa aaaa n zzA 

<210> 339 754 

<211> 1042 

<212> DNA 

<213> Homo sapiens 



<400> 339 

tgacaccatg acccactgtt gctccccttg ctgtcagcct acctgctgca ggaccacctg 60 

ctgcaggacc acctgctgga agcccaccac tgtgaccacc tgcagcagca caccctgctg 120 

tgctgtgtgt ccagctgctg ccagccttgc t^ccicccaa cttgct^tcl III 

aaacacctgc tgtaggacca cctgctgcca gcccacctgt gtgaccagct gctgccagcc 240 

ttcctgctgc agcacaccct gctgccagcc cacctgctgt gggtccagct gctgtqqcca 3 00 

ESS* ? 9gtcca ^ ct 9tggccagag cagctcct^t Kcctgfcgi Icgcagaag Hi 

aacctgctac taccccacaa ctatctocct- n C ^rr a ^^^ ~ „„ * y ° u 

ttSc^cc 9C Ca9CCC ^t gcSgcccagc ctgcgtgag acca^t kittle ll°o 

ttgcttccag cccacctgtg tgtccagctg ctgccagcct tettgetget gatcaegtte 540 

"S a cc accatcctca cacaacaaat ttctgctcaa ctgactcatc ttttggggga 600 

ctaatttaat ttgctgctga cagccaccat gctctcaccc aaatttttat gaattctcta lea 

catgtttaaa atcttggaaa tetgettgag ggagggcaga atacttcatc ctgattctct 720 

^ctcao^^ acc "? t ^ tcatgtgcca gcttcatctg ttctcaagtt tglgtcatgg 78? 

tctcagcttt gactctaaag tcaagagctt cattccctgc ttctaaggaa tttaggtttc 840 

tgcaactgat cgatgatctt tgcaatcttt tttttgtttt caatatlltc ctcaESgttc Ito 

ttgtatcctt ctttcttctt ttcatgataa atttgtgttg tgtccctggt agcagaaatc l €Q 

aaaaaaaaaa aaaaaaal** taaattct 9 a accatcctca tltcaaaafa afaaaaaaaa xSSS 
aaaaaaaaaa aaaaaaaaaa aa 

<210> 340 1042 
<21l> 1020 
<212> DNA 



<213> Homo sapiens 
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ctccttcctg gaaacccacc cagaacctcc accctctgac accatggtca actcctgttg 60 

tggctctgtl ?gctctgacc agggctgtgg cctagagaac tgctgccgtc ccagctactg 120 

ccagaccacc tgctgcagga ccacctgctg ccgccccagc tgctgccgcc ccagctgctg 180 

cajgccccag tgctgcclgt ctgtgtgctg ccagcccacc tgctgccgcc ccagctactg 240 

tatotccaoc tactgcagac cccagtgctg ccagaccacc cgctgcagaa ccacctgctg 300 

ccgccccagc gctgS?gt ccaggtgcta caggccccat tgtggccagt <=tctatgctg 360 

ctaqcccatc tgctlgcaaa ccacctgcta caggaccacc tgctgccacc ccagctattg 420 

caStccaqc tgctgccagc cttcctgcag tatctctagc agcagtagct cctcctgctg 480 

tggcctcagc tgctgcaggc tctcctgctg catctccagt tgctgccgcc ccaactgctg 540 

ccaaaccatg tgctgccgcc caacctgctc tagtgcttct tgctgctgag gctgtcatct 600 

qgacccaccl qactctcatc aaccagcatt cttgatgtag ctcatctatg agctgagtta 660 

?gggaagcta Ictggaaaac ttcagltcca accaattctt agattgaatc tggcctccaa 720 

atltatactc cccccacatt ttacctctct accaaatgaa cataagttga atttgctctg 780 

aaatctgtca acStcttaa ttgaaatatt tgctctctgc cataatttct catatggagc 840 

tattccattt taaacaaata tttatctaaa taaatcttaa ataaattttc aggcatagaa 900 



ataaaaaaaa aaaaaaaaaa aaaa 
<210> 336 

<211> 1099 

<212> DNA 

<213> Homo sapiens 



cttcctggaa acccacccag aacctccacc ctctgacacc atggtcaact cctgttgtgg 
etctotolqc tctgaccagg gctgtggcct agagaactgc tgccgtccca gctactgcca 
gaccacccgc £gcag^acca cctgc?lccg ccccagctgc tgtgtgtcca |ctgctgcag 
Iccccagtgc tgccagacca cctgctgcag gaccacctgc tgccacccca gctgctgtgt 
Sccagctgc tfcaglcccc agtgctgcca gtctgtgtgc tgccagccca cctgctgcag 
IcccSatgc £gcc!gacta cltgctgtag gaccacctgc tgccgcccca 3ctgctgcag 
qccccaqtlc tgccagtctg tgtgctgcca gcccacctgc tgctgcccca gctactgtgt 
acccaqctgc ScagacccI agtgctgcca gaccacctgc tgcagaacca cctgctgccg 
ccccagctgc t^tgcgtcca ggtgctlcag gccccattgt ggccagtctc tatgctg.cta 
qcccacctgc tggcaaacca cctgctacag gaccacctgc tgccacccca 3^attgcat 
?tccagctgc tgccagcctt cctgcagtat ctctagcagc agtagctcct cctgctgtgg 
"tcagctlc tgcaggctct cctgctgcat ctccagttgc tgccgcccca actgctgcca 
gaccatgtgc tfccgcccaa cctgctctag tgcttcttgc tgctgaggct 9tcatctgga 
ctcacclglt tctcatcaac cagcattctt gatgtagctc atctatgagc tgagttatgg 
aaaqctaltt qgaaaacttc agctccaacc aattcttaga ttgaatctgg cctccaaata 
cacactcccc ccacttttta cctctctacc aaatgaacat aagttgtaat ttgctctgaa 
a?c?gtcaac ca?c?taatt gaaatatttg. ctctctgcca taatttctca tatggagcta 
ttccatttta aacaaatatt tatctaaata aatcttaaat aaattttcag gcatagaaaa 



aaaaaaaaaa aaaaaaaaa 
<210> 337 

<211> 782 

<212> DNA 

<213> Homo sapiens 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
780 
840 
900 
960^ 
102U 
1080 

1099 



ccaccttcct ggaaatccac ccagaacctc caccctctga caccatggtc *actcctgtt 60 

gtggctctgt glgctctgac cagggctgtg gcctagagaa ctgctgccgt «cagctqct 120 

accloaocac ctgctgcagg accacctgct gccgccccag ctgctgtgtg tccagctgct 180 

gcagaccgca gtgctgccag tctgtgtgct gccagcccac ctgctgcagc "cagctgct 240 

occlqaccac ttgctgcagg accacctgct gccgtcccag ctgctgtgtg tccagctgct 300 

JcSacccca qtgct?ccag tctgtgtgct gccagcccac ctgctgccgc cccagctgtg 360 

qcSqaccac ctgctlcagg accacctgct accgccccag ctgctgtgtg tccacctgct 420 

gccgcccaac ctlctctag? ggctcttgct gctgatgccc tcacctatac tcacctgcct 480 
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<210> 
<211> 
<212> 
<213> 
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333 

1077 

DNA 

Homo sapiens 



<400> 333 
ctcactctcc 

tgtggctctg 

tgccagacca 

tgcaggcccc 

tgtcagacca 

tgcagacccc 

tgtcagacca 

tgcagacccc 

tgcatctcca 

tgcctgcgtc 

gtcatctcca 

gtgaacacac 

tgcggactgt 

tgtatttgca 

ctcctaataa 

ttattcatca 

taaagcctcc 

ttgtaccttg 

<210> 334 



tggaaaccca 

tgtgctctga 

cctgctgcag 

agtgctgcca 

cctgctgtag 

agtgctgcca 

cctgctgcag 

agtgctgcca 

gcagctgctg 

cagtctgtgg 

cctgcccccg 

cacttcctta 

tcaagagaat 

ttctaccttt 

attgacaaat 

tgcctaagga 

aaccttgaag 

ttatttccat 



ccgagaacct 

ccagggctgt 

gaccacctgc 

gtctgtgtgc 

gaccacctgc 

gtctgtgtgc 

gaccacttgc 

gtctgtgtgc 

cccctcttgc 

ccgagtctcc 

ccccttgtgc 

ttacgtcctt 

tgatctgggt 

tgtccaaact 

tgtcctccaa 

atttgaagat 

tccagtgaag 

gtaccaaaat 



ccaccctctg 

ggcctggaga 

tgccgcccca 

tgtcagccca 

tgccgcccca 

tgccagccca 

tgccgcccca 

tgccagccca 

tgtgaatcca 

tgccacacca 

tgtgcctcct 

tcctacagat 

cccataagca 

cccttccttc 

catcctccca 

tgcctccatc 

tctctctctt 

aaacctctat 



acaccatggt 

actgctgccg 

gctgctgtgt 

cctgctgccg 

gctgctgtgt 

cctgctgccg 

gctgctgtgt 

cctgctgccg 

gctgctgccg 

cttgctatcg 

cttgctgcta 

gaaggctctc 

aacctcatcc 

caaaggaatt 

cctctttgac 

atttgtaggg 

aaagtctttt 

tctattggca 



caactcctgt 

ccccagctgc 

gtccagctgc 

ccccagctgc 

gtccagctgc 

ccccagctgc 

gtccagctgc 

ccccagctgc 

cccctgctgc 

cccaacctgt 

aatctctgct 

attgcaaaca 

ttagaaattc 

cattgacaat 

ttcaggacat 

ccacagatct 

gcaaacattt 

ctgaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1077 



<211> 942 
<212> DNA 
<213> Homo sapiens 



<400> 334 

tagaaatcca 

tcagctctga 

cctgctgcag 

tgtgctacca 

attgctgtga 

cctgctgcag 

gctgctgcag 

gctgctgcat 

gctgctgcgt 

cctgtgtcat 

tccttgtgat 

cagccagtca 

tggacattca 

aggccccaat 

tgatgatctc 

aaaaaaaaaa 

<210> 335 



cccagaacct 

gcagagctgt 

gaccacctgc 

gcccacctgc 

atccagctgc 

gaccacctgc 

accccagtgc 

ctccagctgc 

gcgtccagtc 

ctccacctgc 

atttgtcata 

ctggaaaaat 

gctcttctag 

gcagttattt 

atataatatt 

aaaaaaaaaa 



ccaccctctg 

ggcctggaga 

tgccgcccca 

tgccacccta 

tgccgcccct 

tgcaggacca 

tgccagtctg 

tgccacccct 

tgtggccgag 

ccccgcccct 

ctatgaatgt 

gaacacttcc 

gaaatgacag 

ttagatgagc 

gttttcatgt 

aaaaaaaaaa 



acaccatggt 
actgctgccg 
gctgctgcaa 
gctgctgcat 
gctgctgccg 
cctgctgctg 
tgtgctgcca 



tctcctgcca 
tgtgctgtgc 
cttcattagt 
ctgccagttt 
acaatcacat 
agtgtcttca 
attaataaac 
aaaaaaaaaa 



cagctcctgt 

ccccagctgc 

gccccagtgc 

ctccagctgc 

ccccagctgc 

ccccagctgc 

gcccacttgc 

atccagctgc 

caccacttgc 

ctcctcttgc 

catttaaaat 

gtctcatgtg 

tcattaaaat 

ttcgaagggg 

agccacttcc 

aa 



tgtggctctg 

tgccagacca 

tgccagtctg 

tgccacccct 

tgccagacca 

tgtgtgtcca 

tgccgtccca 

tgccgcccct 

tatcgcccaa 

tgctaatgtc 

gcactgtagc 

gcattcagag 

atgttatgcc 

acactaactg 

ctaaaaaaaa 



<211> 924 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
942 
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caccagtcta 

gagacttggg 

tcagacatca 

gtatgatagg 

gaggcaggcc 

agccaagaat 

ctgggagcac 

ctagtcagtg 

taaccctccc 

cagggtgagg 

cagaaggcca 

aagctggcct 

ggtgaggacc 

gcttcaacac 

tggaggagag 

ctgtctcttt 

gtaatccatt 

ttagagcttc 

ctccactccg 

gactgggttc 

agactctaaa 

agtcatgtac 

cattagatta 

aatacctacc 

tttgtagctt 

aggctgcata 

taaaacttac 

tagtgtattt 

caaaagattg 

ttgggaggcc 

atggtgaaac 

tgtaatccca 

gttgcagtga 

tctcaaaaat 

tgtagtaggc 

aaaatcattt 

tgtctgccta 

gcaggaagga 

tctgagcaga 

gtgtcagcag 

cggtgacggg 

tgtgtgcgcc 

cccactttgt 

gctcccacgg 

cccacgccgc 

gcacagggct 

ccctaggtaa 

gcctggagcc 

tgcaggacag 

tctcagcatt 

ctcttgccca 

ggtgggaacc 

acctcagcat 

ccaacacatc 

cacagagacc 

catttcccca 

c 

<210> 366 



aggggcagga 
aaaggcattt 
atggggttag 
gactatggta 
atgcagaaat 
ttcaattatt 
taatttgcaa 
gtcagtggtt 
ttgcctccca 
cggccctcag 
agcaggacat 
ggactttgag 
acagctgtgg 
ccaggagcat 
aatttgcagg 
gctttcccta 
ttgtgggtcc 
agatagtaaa 
ttgtcatgta 
caagaaaact 
aggaaagatg 
cacataagga 
taccatattt 
atggtgttac 
agaagcaaca 
cagcccagga 
gagattttcg 
tatgtgtggc 
gacacccttg 
aaggcaggtg 
cccgtctcta 
gctactaggg 
gccgagatcg 
aaaaataaaa 
tacaacatgt 
aatgctgtat 
tagagaggac 
gtgcccatgt 
aggggtctct 
ctcccgcggc 
cagcgtctgc 
ctgcggccag 
accgacgccg 
tccccagacg 
gccttcctgt 
cctagcccgc 
agaccccggt 
ccttggtgtg 
gccacctctc 
tccagtaaag 
cttggtcttc 
ccatcactgt 
ttgaaagttc 
ccacatagct 
tccctgtggc 
tgcataggcc 



aacctgggct 
aacctctctg 
taaatgagcg 
aactggaaag 
gcaggtccag 
gtgttaaatt 
cccctggcct 
acctggcctc 
gaactccaag 
tgatgcccgc 
ggcctgcttg 
atcgccacct 
ggagggtgtc 
gtctgcatat 
ttaaacagat 
taaggttgtg 
tgaaataaca 
aactttaggc 
aagcagccac 
tgattgacaa 
gagtggggca 
cttttaggtc 
ttactctact 
agttgccttt 
ggctatgcta 
cggctttgaa 
tgtgtgtgat 
ccaagacaat 
gggcctgggt 
gatcacttga 
ctaaaaataa 
aggctgaggc 
cgccactgca 
aataaaaaag 
aggtttgcgt 
ttctcagaac 
ctgggctaac 
gagggtttga 
ggccgccagg 

ggggtcttat 

agtgccccct 
agcagcggca 
cctcaccagc 
tggggaagga 
ctcgcatccc 
agctctccca 
ccggaggcct 
gacttgctgt 
ctgcctggct 
ctcctgatca 
ttagtcggga 
ggctgctggt 
cctgaccata 
cccccatcca 
tccacagtac 
ctgccacaga 



ctcaaacaag 
gcctgttgtt 
aagcaagcaa 
cacaaaaggc 
tgtggccaga 
tccaaaattg 
aacaaataga 
tgaccctgaa 
ctggaggccg 
tgcaagctgg 
atcagggagt 
acaggcgcct 
ccctcagttt 
tatttacttg 
ttcaccagct 
tgagggcgtt 
gcttggggtt 
tcgtgggcca 
agccaaaggc 
acaggaagcc 
gtgagtgctt 
aaggatggat 
ttcctaggtt 
agtcttcagt 
tagggtaaac 
tgtggcccag 
tttttaaagc 
tcttcttctt 
gtggtggctt 
ggtcaggagt 
aaaaattagc 
aggagaatca 
ctccagtctg 
gttggccacc 
atgtgcactc 
atgtctctgt 
agtgtatcaa 
ggggcgcgca 
agactgggcc 
gcggggacct 
gcagcgggaa 
gcggccgctc 
agcagcctct 
gggcttcggg 
cgcccgctag 
acccccgaat 
ggatgcctgt 
gccttctaat 
gcctggctcc 
tatgcattgt 
gggagagggc 
ccactttttc 
gggtcaaata 
caccaccacc 
ccatcatatg 
acacctgcca 



ctctgccact 
ctagatcagt 
gctgttccca 
agggctaggc 
tcctctgatc 
aagacactgt 
caaggcctct 
gccagaacag 
cggtggccca 
ccgagctgga 
accaggaggt 
gctggagggc 
cccccaggca 
gaatgtgcat 
ttgctcccca 
ggtgctgtaa 
ctgggcaggc 
tgtagtctct 
aatacttgaa 
tgcatgtggc 
atagtaactt 
ggaatatagg 
tagctatgtt 
acagtcacat 
ttgtcaaacc 
cacaaatttg 
tcatcagcta 
ccagtgtggc 
acacctgtaa 
ttgagatcag 
cgggcaaggt 
cttaaaccca 
ggcgatagag 
cttgccacag 
catgatgttc 
tgtgaggcga 
ggacgcctga 
tagaaacaga 
ctgtcctccc 
gtgcgtgtcg 
cgtggcggtg 
cgtgcgcttc 
gcctccagag 
gcccagcctc 
tcccttcact 
ctaaaggact 
ggcttctgag 
ccaatgtctt 
ctccctctcc 
gccctgagag 
ctggatctcc 
cactctccac 
tacagacttc 
atcacccctg 
tgctgctgcc 
cactcacata 



agtttgccat 
gatcacacat 
aggcacttgg 
agatccaact 
ttttgaaaga 
gcaaacatgc 
ttttcttctg 
agatccttaa 
gtctgagcag 
gggtgccctg 
gatgaactcc 
gaggagcaga 
gtgctgcagt 
caacgctaat 
caactaattt 
atgtctgtaa 
aaactttttc 
gcctcaactg 
gaagcggtat 
cgtagtttgc 
aattagctac 
acagtagtcc 
cagatacaca 
gctgtacaga 
cctgaccaga 
taaacattct 
ccgttagtat 
ccagggaagc 
tcccagcact 
cctggccaac 
gatggacacc 
ggaggcagag 
agagactccg 
agcccaggtg 
gcacaatgac 
agcatggcta 
ggctaagact 
ggggagaagc 
cctctggcag 
ggctcccggc 
agcaccggcc 
gcctgagcgc 
gcagcccgga 
cccatcggcc 
gcttccgaga 
gtcctcgcag 
ggacagatgg 
ctaatacacg 
cctgtgcttg 
tctctccttc 
tgacaccttg 
cccagccccc 
ctcccagaag 
gggtcccaga 
cacatgggca 
gcacacagac 



6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8.04 0 

8100 

8160 

8220 

8280 

8340 

8400 

8.460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9721 



<211> 4721 
<212> DNA 



<213> Homo sapiens 
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<400> 366 
tgctagcagt 
gagagtcgcc 
aggtcctgtg 
tccgggccct 
gggggctttg 
gggcccagca 
aacctggaga 
aggtttactg 
ccatgtgcac 
tgcctgaggt 
acagacagac 
tgtggaggaa 
gattttatat 
gcactgcggt 
gtaacagctg 
aagctcagtt 
ttaggtttgt 
agttcaggca 
aggcagccct 
taggaatgtt 
aaagagtgct 
gcgggggtag 
gtctctacta 
actccagagg 
gaggttgcgc 
aaagtgctat 
gtgcaaactt 
aaagtaaaac 
ctgctataat 
aataatagaa 
gttttaatgc 
tcaactgagg 
cgatatctga 
atcacccact 
aaaagggtcc 
agcagagaag 
ccatggatca 
ttacgccttc 
cttctgagag 
cagtggtgtg 
cctcagcctc 
attttttttt 
acaaactcct 
gagtaggccg 
tcaaactcaa 



gagggaaagg 
atgacttgta 
cagccactca 
ggtctggtgg 
gcactggttc 
ctgcctgtaa 
ttgcacagtg 
ccttcctcca 
aaccaggact 
cccagtctga 
agaggcaccc 
agatgccaaa 
gcaattttag 
cccacaatac 
gattgtttgc 
actgtgattg 
aagggataat 
agctgattta 
gcttataggc 
ggaattagca 
ggccaggcgc 
atcacgaggt 
aaaatacaaa 
ctgaggcagg 
cactgtactc 
agtaccctgg 
gccttgcaat 
aggtcatgag 
acatcataga 
ctcatttatt 
attatgccaa 
acaatcaaac 
gaaagactgc 
cttcagcaaa 
attcagactc 
tttcttccca 
agacttctct 
ctcattcact 
taattttttt 
atcttggctc 
ccaagtagct 
tttttttttt 
cactgcaggt 
ccatgcccaa 
atctttacat 



ctcttgcagg 
ctggacaaca 
gacgacattg 
gatgtggcac 
gatagactta 
gcagggcagg 
tccctaaagg 
ggggagatga 
atcaaggagc 
caggtatggg 
gcctatgtgt 
gtccctggtt 
aggtgctccc 
cctatcttgt 
tgagcagcag 
caccatgcag 
gaaggctgcc 
agagctgggc 



ctatttgttc 
agatccatgt 
gccaggacct 
tcacctgcag 
ctgcctgcta 
ttttgcctgg 
actgccatgt 
aagggatatc 
aggccacggt 
tgacccacat 
gcatctgtgc 
gccctacgtg 
gtggtcagtc 
agctgaatgg 
tctcttctgg 
ggtgaggtgg 
aaggccaggc 
tcttcttgcc 
ccagagctca 



tcagagtcct 
gattgtcttg 
gagatatggt 
ctgctgcatc 
ctgtggacac 
caccattgtg 
cataaagcag 
caaggtgggt 
gggcactgag 
tgggagaggc 
aaggcccaga 
tttcaaaatt 
aattgggcag 
ttatggacag 
agctggtgtt 
gctgagactc 
tggaaggaca 
ctgtcagtcc 
tattgcaagg 
gtttctggcc 
ggtggctcac 
caggagtttg 
aattagcttg 
agaattgctt 
cagcctgggt 
cacataagtc 
atactcatta 
aacaaatgag 
ttgatccctt 
ttctactccc 
agtatatttt 
ccttcacaat 
cattgagccc 
acttcagggc 
ttagaactgc 
gaaacagaca 
gccgtcatgc 
tggcaggata 
ttctttttga 
actgcaaacc 
gggattactg 
ttggtagaga 
gatctgcctg 
ccttctgaga 
tacctacaaa 
aaattafcago 
tctccaatcc 
gaacatggac 
ctgggccaag 
gatgtggcag 
ggattctgga 
gtggttgcac 
ccacactgag 
gatctggcac 
gatccagagg 
ccaaggccag 
gatctcagca 
agggagtgcc 
cagactggac 
tcccccagaa 
cccttagcga 
agtaccagga 
aacttctaga 
tctgcataga 



cttcttcctc 
cctaggctcc 
tatgacccca 
acggcaagcc 
agattcagct 
ttgatcaatg 
gaggaggaga 
gtcctggatc 
gaaagaatca 
acacagacag 
aatactctga 
tcaatgaatt 
cccttagaac 
aaaacagaaa 
tgtgaacatt 
agctatttgt 
gccgagaaag 
tgccgggcta 
ttttataggg 
aaggatctga 
gcctgtaatc 
agaccagcct 
gcgtggtggc 
gaatctggga 
gacaaagcaa 
aatttgcaag 
gtcttggttc 
tctggaagtc 
atgaaatagc 
ctaaactctg 
caccaggtaa 
ctagaatgca 
cattggaggg 
cttggacctt 
acatccattg 
gtattctaga 
aactcttacc 
atgctgtaat 
gacggagtct 
tctgtctctt 
gcatgcacca 
tgggatttca 
cctcagcctc 
gtaacttgac 
gtccacctat 
tatgttctca 
cacatctcag 
tgcatcattg 
gctgagtctt 
ttgggagggc 
tagtaggcca 
aggctgagca 
cctcaggagc 
ggggagactg 
ctgatggcca 
agagacttat 
ttcattctcc 



tggcttcatg 
cgagcggagt 
ccctcagctg 
cctgctacca 
accactctgg 
agagcctcct 
agatcaagtg 
acacccttcc 



aggatgatga 
atattcaggt 
ctccaagctg 
tgaccgctgc 
ggtgatgaac 
tggtgaggag 
cctgccccac 



gaggcaggca 
acagacagac 
cagttcagat 
tagtttaaag 
tgatcagagg 
gtgacacatg 
tggctgcctg 
tacaaaggaa 
gaataaggcc 
cctcctgaca 
catgtattcc 
ggtacagtta 
tcagcacttt 
gaccaacatg 
acgcacctgt 
ggcagaggtt 
gattctgtct 
aaaatctgtt 
ttgaatcaca 
taattccgta 
tactggaagg 
acataactaa 
aggaagcttt 
gagatcaggt 
ggccatatca 
cggtctacat 
gagaccttaa 
tgtggacagc 
tctcttaatt 
tcgaatttca 
ctattgccca 
gggttcaagc 
acacactggc 
ccatgttggc 
ccaaagtgct 
ggagtgttgt 
tggccaactt 
acttaagtca 
acacctctgt 
ccgagatcaa 
ggtaccatgg 
aggatgtgag 
atagctctca 
ctgcacaacc 
atctctgctt 
tgggctgcac 
aggtggagaa 
ggcctaacct 
agtccctttg 
aggcaagagg 
aatggacaga 
gaggtggagg 
aagctggtcg 
tccaagctgg 
cagaggtggg 
tttttagcac 



tcctccagca 
ctgcaaatcc 
catcttgccc 
tggcctcact 
cagcacatgc 
catgcccctc 
cctcaacagc 
tgaaccccca 
agaccctgcg 
agacagacag 
acggagagct 
atcttagttg 
aactctggct 
gaaaacagaa 
tgattgactg 
aattctgaag 
aatagaccca 
aaagcagagg 
taattccaac 



ttataggtta 

gggaggctga 

gtgatacccc 

aatcccagct 

gcagtgagct 

caggaaaaaa 

tgggattttg 

tgactgggtt 

atagggacca 

cctatgccct 

atgctgcaag 

tcaagatcta 

catctggaga 

agttcttctc 

ctcgcaactc 

gataaagcca 

tttcccaaga 

ttttccttgc 

taatcagtag 

ggttggggtg 

aattctcatg 

taatttttgt 

catgctggtc 

aagattacag 

atctgtcatg 

cagcaacatc 

ctgcccccac 

ggttgtcaag 

ggcacagtat 

ctgcctgcca 

atacctatta 

ggttggggtg 

tgcacaatca 

cccccatttc 

caaggaggag 

tgccaagtgc 

ttgtcacaca 

tccatgtaga 

cctgtttctg 

gagaccgtaa 

tgacccagtc 

agctggaggg 

gcctcaatgt 

tgcagcctgc 

catcacttac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200^ 
1260^ 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
258fl 
264^ 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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ccagccctca 
tgagcacagt 
cttgcatgat 
atgccctaat 
ttcaggctgt 
tgagctggtg 
cacactgaaa 
tagggagact 
gttcttttgt 
ctggcacttg 
acccaggtct 
agttcagagc 
taaagctgac 
tgtgagtaat 
gctctgaggg 
<210> 367 



ctactacttc 
ccaaggagcg 
ctattaggga 
ttataacact 
tgctgtgact 
ggagctggtt 
gacaaatagg 
gtgggggttg 
gtcctgctgc 
tgttgttttg 
tccctcctct 
tggtgaccac 
cttttctctc 
tctggccctg 
tggaaatacc 



tgttttcatc 
gagtcttagt 
caatggtttg 
taccgatttc 
tcatgtagaa 
caagaacaac 
aagatatttg 
999agtccgc 
attttcacag 
ggacagggtt 
cactttctgg 
atggtctctg 
ctgcaccagg 
aagggagact 
aaggaagtgg 



ttggaaccct 
tgagtccaca 
tctcagagaa 
agagttgtat 
ttggctagaa 
attggttggg 
tctcttctct 
ctggtgtgga 
gctccacttc 
gggacgatgc 
gaagggcttg 
gtgtcattgt 
ctgtgtgatg 
ctgaggtttg 
ttccattcat 



tttaggacct 
ggccgatttc 
cttgttctct 
atacttctgt 
atgtgcacaa 
gctaggatgg 
tagatgaaaa 
gttattggga 
ctggggggat 
tgggcctttg 
ggtgacctct 
caggaattaa 
gcatcagtgc 
ccagtgggac 
t 



ctactggtat 
caataggctc 
gagataaaca 
catggctgat 
taggctcttg 
aatggtgtct 
acctgaggct 
cagttggaga 
ctgggagagc 
agggtgctag 
tggttgactg 
tgcccagtgc 
tgtgaatatc 
gaatttttta 



3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4721 



<211> 1925 

<212> DNA 

<213> Homo sapiens 



<400> 367 

actccaggtc 

gatcaggatt 

gctgctgcat 

gcttcggcag 

gctaccgctc 

acgagagcct 

aggaggagaa 

tgcgcttcct 

gcgagtgttg 

gggaggccga 

aggaggtgct 

ctgagaacga 

acctggaggc 

aggaggagat 

acaacagccg 

acattgtcac 

tgaaggccac 

agctgaaccg 

ccaagctgga 

cccgctgcaa 

gcctgatcag 

ccacctacag 

tgaatgtctg 

gctcccggcc 

gcaccggcct 

gcgtgggctc 

aatgttaggc 

ttggcccagt 

acccctccca 

ccccacagtg 

caggagcagg 

ccttcctcct 

aaaaa 

<210> 368 



ccctatcctg 
tggtgggcgc 
caccgccgcc 
ccacagcgtg 
cgggggcgtg 
cctcacgccc 
ggagcagatc 
ggagcagcag 
ccagagcaac 
gtgcgtggag 
ggagggctac 
gtttgtggct 
caacgtggag 
ccgcattctc 
ggacctgaac 
ccgcagccgg 
ggtgatcagg 
catgatccaa 
ggccgcggtg 
gctggccgag 
ggagtaccag 
gcgcctgctg 
tgtcagcagc 
agtgactggc 
gtgtgcgccc 
ctgtggtatc 
accccaactc 
cagtcaggcg 
ctcctggcct 
ggaaaggcca 
aagggccagg 
ctgctcttga 



tcctctgcaa 

gccttcagct 

ccctaccgtg 

tgcggaggct 

tgcgggccca 

ctcaacctgg 

aagtccctca 

aacaaactgc 

ctggagcccc 

gccgacagcg 

aagaagaagt 

ctgaagaagg 

gccctgatcc 

cagtcgcaca 

atggactgca 

gccgaggccg 

cacggggaga 

aggctgacgg 

gctcagtctg 

ctggagggcg 

gaggtgatga 

gagggcgagg 

tcccggggcg 

agtgtctgca 

tgcggccaat 

agctccctgg 

aagtcccagg 

cctggagaag 

cacatttctc 

ccctagaaag 

accagaacct 

tctgtgtttc 



cccaaacgtc 

gcatctcggc 

gcatctcctg 

ttcgggccgg 

gtcccccatg 

agatcgaccc 

acagcaggtt 

tggagacaaa 

tgtttgaggg 

99aggctggc 

atgaggagga 

atgtggactg 

aggagatcga 

tctcagacac 

tcattgccga 

agtcctggta 

ccctgcgccg 

ccgaggtgga 

agcagcaggg 

ccctgcagaa 

actccaagct 

agcagaggct 

gggtcgtgtg 

gcgctccgtg 

tgaacaccac 

gtgtggggtc 

ccccaggcat 

tgctcagcta 

tgtgtgatcc 

aagtccgctg 

tgcccacggc 

aataaattaa 



caggaggatc 
ctgcgggccg 
ctaccgcggc 
ctcctgcgga 
catcaccacc 
caacgcgcag 
cgcggccttc 
gctgcagttc 
ctacatcgag 
ctcagagctt 
ggtttctctg 
cgcctacctc 
cttcctgagg 
ctccgtggtt 
gattaaggca 
ccgcagcaag 
caccaaggag 
gaatgccaag 
tgaggcagcc 
ggccaagcag 
gggcctggac 
atgtgaaggc 

cggggacctc 

caacgggaac 
ctgcggaggg 
ttgcggcagc 
ctttgcctgc 
cttctcctgc 
cccacttctg 
gcacccatag 
aactgccttc 
tgtagccaaa 



atgacctgcg 

cgccccggcc 

ctcaccgggg 

cgcagcttcg 

gtgtcggtca 

tgcgtgaagc 

atcgacaagg 

taccagaacc 

actctgcggc 

aaccacgtgc 

agagcaacag 

cgcaagtcag 

cggctgtatg 

gtcaagctgg 

cagtatgacg 

tgtgaggaga 

gagatcaatg 

tgccagaact 

ctcagtgatg 

gacatggcct 

atcgagatcg 

attggggctg 

tgcgtgtcag 

gtggcggtga 

ggttcctgcg 

agctgccgga 

cctgccttgc 

actttgaaag 

ggctctgcca 

gaaggggcct 

ctgcctctcc 

aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
! 600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1925 



<211> 2108 
<212> DNA 



<213> 



Homo sapiens 



BHC 03 1 051 
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<400> 368 

tccattcgga 

tggatcttac 

ccgctgctgc 

gggcttcggc 

cggctaccgc 

caacgagagc 

gcaggaggag 

ggtgcgcttc 

ccgcgagtgt 

gcgggaggcc 

gcaggaggtg 

agctgagaac 

agacctggag 

tgaggaggag 

ggacaacagc 

tgacattgtc 

gatgaaggcc 

cgagctgaac 

ttccaagctg 

tgcccgctgc 

ctgcctgatc 

cgccacctac 

ggtgaatgtc 

cactactgcc 

ggtgggcact 

taagaggtgc 

gccagtacct 

gctgcctgca 

agtccgggga 

ttgttgtccc 

gcattggtct 

gtccctgtct 

ggaccctgga 

atctccttcc 

ctttgtcttt 

aaaaaaaa 

<210> 369 



cgtctccatc 
tgtggtggcc 
atcaccgccg 
agccacagcg 
tccgggggcg 
ctcctcacgc 
aaggagcaga 
ctggagcagc 
tgccagagca 
gagtgcgtgg 
ctggagggct 
gagtttgtgg 
gccaatgtgg 
atccgcgttc 
cgggacctga 
acccgtagcc 
acggtgatca 
cgcatgatcc 
gaggctgcgg 
aagttggccg 
agggagtacc 
aggcgcctgc 
tgcgtcagca 
cctgttgtct 
actaacgcct 
taaggaggct 
cgcgccagga 
ctctaagcgc 
gggccgggag 
gaggattcat 
tgcctgagct 
gcacgacctg 
tagtggttct 

ggggctgccc 

gtttcactct 



ctcagaacct 

gcgccttcag 

ccccctaccg 

tgtgcggagg 

tgtgcgggcc 

ccctcaacct 

tcaagtccct 

agaacaaact 

acctggagcc 

aggccgacag 

acaagaagaa 

ctctgaagaa 

aggccctgat 

tccagtccca 

acatggactg 

gggctgaggc 

ggcacgggga 

agaggctgac 

tggctcagtc 

agctggaggg 

aggaggtgat 

tggagggcga 

gctcccgcgg 

ccaccagagt 

gcgccccctc 

gccgcctccg 

gaacgcgccg 

tctccccacg 

gcgccatggt 

ctttttcttc 

cttcctcaaa 

ggactctgcc 

atgactctgc 

tcaagagctt 

gtgtttccaa 



cctctcttcc 

ctgcatctcg 

tggcatctcc 

ctttcgggcc 

cagtccccca 

ggagatcgac 

caacagcagg 

gctggagaca 

cctgtttgag 

cgggaggctg 

gtatgaggag 

ggatgtggac 

ccaggagatc 

catctcagac 

catcattgcc 

cgagtcctgg 

gaccctgcgc 

ggctgaggtg 

tgagcagcag 

tgccctgcag 

gaactccaag 

ggagcagagg 

tggcgttgtc 

cagtagcgtc 

cgcccgggtt 

ccagcgcctg 

cccgcgccgg 

tccgctccgg 

ctctctctgt 

ctcctacctt 

gcttggagga 

catgtgcttt 

gagggacagg 

ctgaaaaact 

taaactcatt 



ccaaaaagca 

gcctgcgggc 

tgctaccgcg 

ggctcctgcg 

tgcatcacca 

cccaacgcgc 

ttcgcggcct 

aagctgcagt 

ggctacatcg 

gcctcagagc 

gaggtttctc 

tgcgcctacc 

gacttcctga 

acctccgtgg 

gagatcaagg 

taccgcagca 

cgcaccaagg 

gagaatgcca 

ggtgaggcgg 

aaggccaagc 

ctgggcctgg 

ctgtgcgaag 

tgtggcgatc 

cccagcaaca 

ggcgtctgcg 

tcgccgtcac 

cctcccaata 

gagccatccc 

agcctttcct 

ctgttttttt 

acgggggagg 

tgcctgtgga 

cccacgcgtg 

aatgactctg 

gtagcgaatc 



ccatgacttg 

cgcgccccgg 

gcctcaccgg 

gacgcagctt 

ccgtgtcggt 

agtgcgtgaa 

tcatcgacaa 

tctaccagaa 

agactctgcg 

ttaaccacgt 

tgagagcaac 

tccgcaaatc 

ggcggctgta 

ttgtcaagct 

cacagtacga 

agtgtgagga 

aggagatcaa 

agtgccagaa 

ccctcagcga 

aggacatggc 

acatcgagat 

gcgtcggctc 

tctgcgcctc 

gcaacgtggt 

gcggcagctg 

tctccaccca 

gccgccgccc 

cggtcgcagg 

ggtagtcaat 

tttcatgtat 

gcccgggaat 

atggagacgc 

tggggagaac 

ctgccttctc 

aaaaaaaaaa 



60 
12 0 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
102 0 
1080 
1140| 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2108 



<211> 1883 

<212> DNA 

<213> Homo sapiens 



<400> 369 
gttccatcct 
tggcttcaac 
cgggccccgg 
ccgcggcctc 
ctgcggacgc 
caccaccgtg 
cgcgcagtgc 
ggccttcatc 
gcagttctac 
catcgagact 
agagctcaac 
agcacttcga 
ctacctccgc 
cctgaggcgg 
cgtggttgtc 



ctgccatcta 
tccataggct 
ccaagccgct 
accgggggct 
agcttcggct 
tcggtcaacg 
gtgaagcagg 
gacaaggtgc 
caaaactgcg 
ctgcggcggg 
cacgtgcagg 
gccacagcag 
aagtcagacc 
ctgtacgagg 
aagctggaca 



ctccactgtt 
gtgggttccg 
gctgcatcac 
ttggcagcca 
accgctccgg 
agagcctcct 
aggagaagga 
gcttcctgga 
agtgctgcca 
aggccgagtg 
aggtgctgga 
agaacgagtt 
tggaggccaa 
aggagatccg 
acagccggga 



cagacacctc 
ccctggaaac 
cgccgccccc 
cagcgtgtgc 
gggcgtgtgc 
cacgcccctc 
gcagatcaag 
gcagcagaac 
gagtaacctg 
cgtggaggct 
gggctacaag 
tgtggctcta 
cgtggaggcc 
cattctccaa 
cctgaacatg 



ctaacctccg 
ttcagctgtg 
taccgcggca 
gggggcttcc 
ggacccagcc 
aacctggaga 
tccctcaaca 
aagctgctgg 
gagcccctgt 
gacagtggga 
aagaagtatg 
aagaaggatg 
ctgatccagg 
tcccacatct 
gactgcatcg 



tcatgacctg 
tctctgcctg 
tctcctgcta 
gcgccggctc 
ccccatgcat 
tagaccccaa 
gcagattcgc 
agacaaagct 
ttgctggcta 
ggctggcctc 
aagaagaagt 
tggactgtgc 
agattgactt 
cagacacctc 
ttgccgagat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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caaggcacag 
cagcaagtgt 
caaggaggag 
tgccaagtgc 
ggcggccctc 
caagcaagac 
cctggatatc 
tgaaggtgtt 
ggacctctgc 
cgggaacctg 
tggagggggc 
cccttctgcc 
tggcttttcc 
cccacctttc 
tgtctcagcc 
gagactgttc 
tgcaaaaaaa 
<210> 370 



tatgatgaca 
gaggagatga 
atcaacgagc 
cagaactcca 
agtgatgccc 
atggcctgcc 
gagatcgcca 
gaagctgtga 
gtgtcgggct 
gtggtgagca 
tcctgcggcc 
tgccagacgt 
ccttgtgttc 
atcccactgg 
ttcagcccct 
cccctgcctc 
aaaaaaaaaa 



ttgccacccg 
aggccacagt 
tgaaccgcat 
agctggaagc 
gctgcaagct 
tgatcaggga 
cctacaggcg 
atgtctgtgt 
cccggccggt 
ctggtttgtg 
aggggaggta 
gccactgccc 
tgagaataca 
aaaggggtct 
cccggggaga 
tctggcctct 
aaa 



- 189- 

tagccgggct 
gatcaggcac 
gatccagagg 
tgcggtggcc 
ggccgagctg 
gtaccaggag 
cctgctggag 
cagcagctcc 
gacgggcagc 
caagccctgt 
ttaagtggcc 
caccaccagc 
ccatcggctc 
gtggctgggg 
agggccttgc 
tgtctcccct 



gaggccgagt 
ggggagaccc 
ctgacagccg 
cagtctgagc 

gagggcgccc 

gtgatgaact 
ggcgaggagc 
cggggtgggg 
gtctgcagtg 
ggccagctga 
caaaagagag 
tgaaaacagc 
attcccacca 
aatagaccca 
ttccctggaa 
tttccaataa 



cctggtatcg 
tgcgccgcac 
aggtggagaa 
agcagggtga 
tgcagaaggc 
ccaagctagg 
agaggctgtg 
ttgtgtgcgg 
ccccctgcaa 
acaccacctg 
ccaggggagc 
agcacatcgc 
gtggctcctc 
ttccttcccc 
gaagcactgt 
acttgggacc 



<21l> 13464 

<212> DNA 

<213> Homo sapiens 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1883 



<400> 370 

gcaggttgca 

ttaataatgt 

ggagcagtaa 

ctgtcccgca 

tcaagaactt 

gtgccatggc 

ttggcagtca 

ccctacacag 

cccgcatcac 

accccaatgc 

ttgctgcctt 

ggggaaaggg 

acgtggctgt 

caaatgcaaa 

caggcaagca 

agtaggacag 

ggagatgcta 

tgcctgctcc 

cgaggcactt 

aagcatggtc 

tcctggttga 

acagtctttg 

ctgaatggtg 

gtgggtggaa 

gaggtgctgg 

tcagcacaaa 

caagagcagg 

agggggctgg 

catgctcaaa 

ccgttgagag 

gcttttccct 

ggctccaaac 

gccagagggc 

gataatcaga 

gtaccaggct 

ggtaaatgca 

gcatgcccac 

tgccagaact 



gtttcccttg 

tctctgcctt 

caggcctccc 

gaacagaggc 

cagctcccgc 

ccatcatggg 

gagcctgtgt 

caggggcagg 

atctgttacc 

ccagtgtgtg 

cattgacaag 

acttggaacc 

gagaacgtgg 

atgaatctga 

gcaagcatga 

tgtggcagag 

ccgtccctag 

tccgtggtgg 

tataccctgc 

cagggagagg 

gtgtatagct 

ggggaggaca 

tgtatatcta 
gttctgctct 
gtgtccattg 
gaagctcact 
ggaggttgaa 
aggcatcttt 
ggggtctctg 
gaaccagttc 
cccttggggt 
tgcattgagc 
tgtcccatga 
gtttttgttc 
tgtactccca 
aacttcaacg 
ctagagggac 
tctggtatgt 



cgttctgcct 
tatgttgtca 
atgtgagacc 
atggcgagcc 
tctgccactg 
gtcagccctg 
acggtggggt 
tttggctact 
atcaacgaga 
aagcatgagg 
gtgggtctgt 
cgatgatagg 
ctgaatcagg 
aaccaccaga 
tgtgtgtgca 
cgggccactc 
catggagcag 
ggacatcata 
agcacatgca 
ctgttgaatt 
ctggtttcct 
gccaccaaag 
caccctggga 
cagatcaacg 
ttaacatacc 
gtagagttag 
atccagccca 
tcctaatttg 
catgtggcct 
tgggtctggg 
tgctgatggc 
tccagactac 
cctgggtctg 
cttgtccctc 
agaggttagg 
ggggcttggt 
agtagctgct 
caagagggtc 



agcgggagat 
ggaggcctgt 
gtgtcatcct 
aatcctgcca 
tgcccaagcc 
gggggctagg 
ctccccggat 
gggcaggggg 
gcctcctcat 
agaaggagca 
gagctgtttc 

gtggttctaa 

gtcttggttt 

cagaagaggg 

ccgtccatct 

tgcctaggcc 

gagcaggatg 

ggaaggctgt 

tagattaaag 

gacttttctg 

ttaacctaat 

tttaaaggag 

ctgcataaga 

ctacatcaga 

gctggctgtc 

gcaggttatt 

ggctttgctt 

ctcaaaagca 

tttgggcatc 

gagaggaact 

caccctcttc 

aaagtgcatt 

catgttgaaa 

aagcacggac 

gttggagaag 

gggtcacatg 

gctcactcca 

agagagatcc 



tcctatgctc 

cccctgccca 

tcaatcctca 

catcagctct 

tgggtatcac 

ctccaggcgc 

cgcggtgagt 

cctttgcagg 

gcccctcaac 

catcaggtgt 

ctgggactgt 

ttctcctgtt 

tcaaatgttg 

gtgtttgctg 

tcccgatctc 

tgtctgacct 

cagaggggtc 

gaacccttag 

gcagtgagtt 

atgctgcagt 

gatgccctcc 

gagaagctcg 

aggcacaaac 

agtgggcata 

cagattactc 

cattgcacaa 

gcagagccat 

cagtgagaac 

acccatgaat 

tataccactg 

cctccagcct 

gatatggtgg 

tccaaggcag 

aattggacag 

aaatcacctg 

aatgaagcca 

gccacaggcc 

agattcttac 



ataaatggca 

ataaaaggca 

gtccagggga 

ggctgcgggg 

agctgtgtca 

ctcggaggct 

tgtagatggc 

cccagcccac 

ctagagatcg 

ctcaacaagt 

ggcttaggaa 

actgactagc 

ccatgtcagt 

ctgctgctca 

accggcacgc 

cccgcttaga 

agaggctcct 

gacttctgcc 

gagaaactta 

ttcagagtcc 

ctaggagctc 

tcttccaggg 

ca gggcgatg 

tctgcacaca 
ttcccttcca 
ggacacctag 
gtgccctggc 
cctgagagtg 
gctgctctct 
caccaggctg 
tcttcctgca 
gtttaatcct 
atgccttttg 
ttgagggttt 
ctttagggct 
aacttgagag 
tacctagcct 
ttgaaaccac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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190- 



tcagttatta 
gacatgcctg 
tcctgtcttg 
agagttagag 
gccccagcac 
aaatgcccac 
taaggtgtcg 
gacatttagg 
tcttgctggc 
cttgtggccc 
tttcaaatcc 
tatgaaatat 
atagtgtgct 
gacctgcact 
cagggcgggg 
tcaggctttc 
gcagaacaaa 
caacctggag 
ggaggccgac 
ctacaagaag 
gctgagatca 
agctatcaga 
acccctgtgc 
gacacacatt 
aggagcaagg 
ggaaagggaa 
tctgatgaac 
tgcagggctc 
gttcattacc 
gaagggtgat 
agttcaggtt 
gcaaatgtag 
gtgcgggctg 
aaaagttatg 
ggccgaggtg 
aaaccgcgtc 
taatcccagc 
tgcagtgagc 
ttcaaaaaac 
gtcttctttg 
gctgggctgc 
aagtccttgg 
tgtgttttgt 
ttccacaaat 
gagattaaat 
agggacctgg 
agtccagaga 
Cggcacsccfc 
tgtttctcct 
tgatgctgaa 
acgctctgat 
ctggagcccc 
tgcaggaatc 
gcttgacttt 
tttttgcggg 
atgatttcaa 
ataaaactgt 
ttctttaaca 
agtgaaggct 
cagctatttg 
tctgtgactg 
cgttctgggc 
ttccccaagt 
ctgtcccttc 
gaagggtggc 
ggcccgctga 



agtgttggca 
tgggcctgat 
ttttacaatg 
tgagaggtct 
cagaggtaag 
aaacttagca 
gcagggctgc 
cttttggcag 
tggctgccag 
cctcttcctc 
ctctagtgca 
ttctgcagga 
ctttctggga 
ttcagacttt 
gccatcctca 
ccctcagtct 
ctgctggaga 
cccctgttgc 
agtggcaggc 
aagtgagcat 
cacaactcct 
cacttctgct 
tgggggctgg 
cagggtacag 
acagcagggc 
aggccctgga 
ttggattctt 
cacaattcaa 
tagtctaaac 
tttgacctca 
ttaccatctg 
acacaatcct 
gtaattatag 
aaaacaggcc 
ggtggatcac 
tctactaata 
tactcaggag 
cgagatcacg 
aaaataataa 
tgggggcagg 
tggaggaggg 
atattggggc 
acagtgtaga 
atttattgag 
gtcttgcttc 
acttcaagtc 
ggcaaaacct 



actaatcaga 
agggcctgga 
caattcagtg 
cctaattcct 
gaagagtcat 
acttaaaaca 
atttcttttt 
aattgaggtc 
aggttgttct 
catcttcaaa 
ggcacctttg 
tgttaatttc 
aggggggtgg 
gccaataatt 
atggcagaaa 
ctttcctcct 
ccaagctgca 
agggctacat 
tggccttaga 
gacagggctg 
ccagaagctg 
ttaccgtctg 
gaaaagattt 
actgatgcag 
tggagaggag 
gaggtatctg 
cctttattta 
ataccaatgt 
tcaccgggtc 
atcaatcctg 
gctggaccca 
cagtgtgcag 
agctgagtgc 
gggcgcagtg 
ctgaggtcag 
atacaaaaaa 



tctctataaa 
gctgccagtg 
agaatgaggc 
ggtctgggat 
ggtatgtaca 
acacaaattt 



gggc i.c *-ggcs 
caggtatgaa 
gaaggtgagt 
gctgtgggcc 
tggcacaaga 
cagacagtgc 
ttttaaacta 
cattgcctcc 
agtgtgacca 
gaatggtttc 
agtaatgtgc 
ggtgacagat 
agtgctggtg 
tcgcatgctt 
cgtggtatga 
ggtctcattt 

tagggggtcc 

tgggcagggt 
ggagagccga 



gctgaggcag 
ccactgcact 
taaagttatg 
gctgctcctg 
tgggggaatg 
agggggaata 
atttaaaggc 
tgactcactg 
aggttttgca 
cagtgtcctt 
cttctccaaa 

gaagagctgg 
aactgttctc 
ctgtcaggtt 
aggtgggtta 
cgatagagcc 
ctggattggc 
ccctgttatc 
tcattcagtg 
ccttttgttt 
ttgcttgcca 
ctgccaatag 
gtttcctggc 
gtactctgtg 
ggcttccttc 
atgcaatggg 
ctgaaggtga 
caggccagag 
ggctcagggg 



ggaggcccca 
cttgtggttg 
cagcttccag 
gccataatag 

gggggatcat 

attctgtgta 
gtggagagaa 
tggcctcctg 
gagctctggg 
tcctgaatgg 
gttctaccag 
ggcgactctg 
gctcaactgc 
ggatggttct 
agactgctga 
ctttatctcg 
agagaagagt 
agatgaagag 
gagcccggag 
aaaatgaatt 
ttctgcagat 
aaatttgggt 
actgtactta 
gtaattccaa 
gcagtgggtc 
gcgtcaaggt 
ttactgtagc 
gctcatgcct 
gagttcaagg 
aatttagctg 
gagaatcgct 
ccagcctggg 
aaaaccatgc 
ggctacaatc 
gcttgaagaa 
aaacctcatg 
tctgtggaat 
cacttcaggt 
gcaacttcag 
ccaatgggtt 
ggcacagaga 
cctcagtc cc 
ccctcagggc 
acatgggaag 
cccattagcc 
gtttggttgc 
ttcataaaaa 
tgaaatgttc 
caagactagg 
caattggtga 
catgttgctg 
gcctccttat 
aggcttggcc 
ctgggctctg 
ggtcagtctg 
tttggcaagg 
aggtaacgtc 
agatcagaag 
gcagagaggt 
cacagcgagg 



acagtgtgca 
taccactgct 
aaagtggtct 
actctgtcct 
tttaaaaatt 
actacctaaa 

ggggagcatc 

caggactgag 
aggctatcca 
cgggttgagt 
tatttgccta 
gaataaacga 
gttacttggg 
gggaaggtca 
ggaccaggca 
caggtgcgct 
aaccgcgagt 
cggcgggagg 
acgcaggagg 
aggcagtgga 
aggccagggt 
tgtcagctcc 
tccctgtctg 
ataggaacaa 
cagtcagagc 
ggctgaagcc 
aaaccatcag 
tttcctgagt 
ccttgtagat 
ccctgcttgg 
ctcatgtccc 
ctgtctccca 
cccaaatgac 
gtaatcccag 
ccagcctggc 
ggtgtggtgg 
tgaacccggg 
caacaagagc 
tctgggctct 
tcacaggcag 
ggggttgctt 
ggggtggctt 
atctgtctgt 
gatttagacc 
ttaagctggg 
ctttgtggct 
aaggagaaaa 
aggggcct-tt 
cacagctgag 
aattaaatcc 
caattgttgg 
agaagccatc 
gaaaacctca 
aattttaatt 
ggttggttct 
tgatggaaaa 
agtgaaagtg 
ttgtgagtat 
ttggtctcag 
aggacaacct 
gctgggaggc 
tcctcggctc 
actggcagac 
tctaggttag 
gggccaggag 
gggacaaagg 



aggcaagtaa 2340 

aatgtatttc 2400 

ggt tggtggc 2460 

gggcttcagt 2520 

gctgt tgtac 2580 

aggctgatgt 264 0 

tgtttccagg 2700 

gtccctgttc 2760 

cattcc ttgg 2820 

ccttctcatg 2880 

gtatgcatag 2940 

acgtttataa 3000 

gactacttgt 3060 

caggctgctg 3120 

aaacctgct c 3180 

t c c t ggagca 3240 

gc tgcgagag 3300 

ctgagtgcgt 3360 

cgc tggaggg 342 0 

gtcagtctta 3480 

tgccctgaca 354 0 

tagttcttac 360i k 

ggagggatgg 3 66 m 

atgaaacagg 372 0 

taggacgggg 3780 

att ccaagta 3840 

gttatcgttc 3900 

tatttagctt 3960 

cagacagagt 402 0 

gattcttgga 4080 

t tggggtgaa 414 0 

ctctgggggt 4200 

catggaattc 4260 

cactctggga 4320 

caacatggtg 43 80 

tgggcgcctg 4440 

aggtggaggt 4500 

gaaactccgt 4560 

gggaactctg 4620 

gaggaagagg 4680 

ctaggaattc 4740 

catagagctt 4800 

ttttaaatag 4860 

aagacacaaa 492 

accagatacc 4 9 

gggaaataga 504 0 

ccagggagtt 5100 

ccacctgggt 5160 

aatgagttca 5220 

tggacaccgg 52 80 

tggtaaggta 5340 

atagagccca 5400 

ccatgctcct 5460 

ccgtcttggt 5520 

cagcttgtta 5580 

gccctgggga 5640 

catgtgaatg 5700 

cacagggtgg 5760 

gcaggagagt 5820 

acagagaggc 5880 

agcagcc tga 5940 

tccacttcat 6000 

aacaaaggc t 6060 

acttgtaagt 6120 

gtggcctcag 6180 

tcactccaca 6240 
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tcctcttgtg 
gaggccaacg 
taaatctctc 
gagtgtggcc 
gcctggatat 
gtgatgccct 
gagatctgac 
cctctccctc 
gattattcag 
tgttgatccc 
ttccagcctt 
ctaatttcag 
aggggatttg 
tcacaaatct 
gacttggttg 
gaggctgaga 
aggaggatag 
ctgccgctga 
ctggctgatt 
cctgcagtgc 
caaggatgag 
cgacaagcag 
tggttcctgc 
actgtgccag 
ttatcaaact 
ttaattttca 
tatttgtatt 
ttccttcctt 
gcctttgtca 
cctttattga 
taatggttta 
cgttagtttt 
ttttttttgt 
atagattatg 
gtgctcccac 
tcactgcaca 
gcgagtggtt 
agtgcttttg 
agttagtgtg 
gccaaagagt 
gcatagctta 
aacaataatg 
tctatgaaag 
taatgatcat 
tcaagtcagt 
tccctgccta 
attaaccgtt 
tccccacaac 
agctggaggc 
actgcaagct 
tgctcaagca 
cttatcgcaa 
tcaggttcct 
agactctaga 
cagagagggg 
cagtctctgg 
ctggatctca 
agcttcagtg 
tgggatgtgc 
aatgagcggc 
cagagcctgg 
ctgaggagaa 
gggcagagtg 
ccacatcgtt 
gagtatacag 
cacttattag 



tctatctcca 
tgagccgctg 
cccatctctg 
tgtcctggtt 
ttttccttat 
tcctttaaga 
aattaattta 
tggccacaat 
gccccaaggc 
tgtattagta 
ttcctcttag 
agttgggata 
gagttgaggg 
ctgacacctc 
tggcttagat 
cctgatacca 
ctgtgagcct 
ggtcctgggt 
gggtcgggga 
gaggaggtga 
ctcaacgagc 
caggttgggg 
ttccctccta 
agtacactac 
tccaaattct 
tttctttggt 
ccccttctaa 
ttttcttagt 
gtacacatat 
gcccaaatct 
gacttagaag 
atagttttac 
atacacagtg 
taatccatcc 
acatgcctgg 
ggatgtatca 
tactttttag 
ttcatttgct 
tccggagttg 
gagaccctgc 
gggcaagtca 
cctgtagtga 
taccttgtaa 
tcctatttca 
tttatgataa 
gttaattttc 
ttgaatggag 
ctgcaccctt 
tgcggtggcc 
ggctgagctg 
gtaccaggag 
actgctggag 
actcagagct 
aatcatggac 
aaggggcttg 
gctggccacc 
ttccaaggcc 
ggttcttaaa 
aaagaatcct 
acaagttctg 
ggttagtggc 
gctatgctga 
atgcgtattt 
agagtttcta 
gtgaagagca 
catgtgacct 



ccaggatgta 
aaggaggagt 
gaagagcaag 
CCtgggttCt 
gtgaccagag 
tgatttgaaa 
ttaatttagg 
ctggttgatt 
99ggtgagcc 
gaattccacg 
aactacagtt 
tttttttttt 
cactggttgc 
catggtggtg 
caaggctcag 
aaccaaggta 
cgagtttccc 
agagcagggc 
cagcagcctc 
aggccacagt 
tgaactgcat 
ggcgccgaga 
tcttcaggga 
tataatcagc 
tgtggatctg 
taagtagtga 
gacctgtcta 
gatttgcagt 
tgcaattaac 
ttaagctgga 
gtattgaggt 
ctcttgcatt 
agtagggttc 
tttcgccagt 
cttaatctct 
ttagtaattt 
atgtgaaatt 
tctgatatgc 
ttgcacatgg 
acagtggcct 
cagctttttg 
caagcttgca 
acgaacaacg 
gtgctaacat 
agtgtacaaa 
catttatatg 
ataaaatagg 
tttaaactcc 
cagtctgagc 
gaggacgccc 
gtgatgaact 
ggtgaggaga 
ggactgaaat 
tccaaccatg 
cccaagagca 
aatgctcttc 
actactttac 
gctggaatca 
atcaacctag 
tccttgcaaa 
ccagggaagc 
caatttgagc 
acacctggga 
gctgggtccc 
caggccatga 
ctttgtacct 



ggctacacct 
cagtcttcct 
tgagagagga 
aggaaagctg 
gggatttttc 
gaaagtggtt 
tagtgtaaaa 
agatttgtgc 

ggggtgacgg 

ttactcattg 
tttaaatttt 
tttttaattt 
tgtcttctac 
aagatggaca 
tatgatgata 
gtctggaggg 
tgggaaacgc 
cagaaggcgg 
tagccattgt 
gacccaacag 
gatccagagg 
ccccgctcca 
ctgctaccat 
atgcaagggc 
ataggtgtaa 
tatcaaatct 
ctcatatatc 
attcctttgt 
tttctctcag 
tgtgcccaaa 
aatagaggta 
taggtctgtg 
taaattaatt 
gatttgtaat 
gtcttctgcc 
gttgtggtta 
agagaagatt 
tttcttaagc 
gagcttcttt 
ttgaatacct 
gagtcttaat 
gaataacttt 
gtatatacaa 
ttcctccctc 
aatgaaggaa 
tatttctttt 
atgaccaagg 
ttcccttctg 
agcagggtga 
tgcagaaggc 
ccaagctggg 
tccggtgagg 
atfcccaggca 
tccccatttt 
cagccagtta 
tccctcctgc 
cctcttgact 
ttcagaaaat 
atagcctggt 
ggaagacaag 
tgtcacagaa 
caagaattga 
tggttgcatg 
ttcagggcgg 
gcccactctg 
cagcatcttc 



atctgcacga 
gcagtccctc 
gacaggggag 
tacattcttg 
tcttgttcgg 
aattcccatg 
acccaagcag 
atagagctat 
catacacata 
ctttgaggcc 
cctgacttgc 
cctgagaaga 
aggaaatctg 
acagctggga 
ttgccagctg 
cagggcagcc 
actcctgcac 
gagagctgtg 
ttctgcgtgt 
ggtgagaacc 
ctgacggcag 
tacacctgcc 
ttggagaggc 
tcctgttcca 
actggcacct 
ttcctcatac 
tgcccatttt 
ccattctatc 
tctatcactt 
tccatcttaa 
gcctcttaaa 
atccatctga 
tttctacata 
actacctcca 
ttctgagttg 
agacctgagt 
cgaagttgtc 
ggggtgaagg 
cagcaagatg 
gagcttggtg 
ttcttcattt 
gaaggtcaaa 
gtataaggta 
taatggctca 
aaaaatccaa 
aaatagtatt 
ccactggaga 
tccatcccca 
ggcagccctc 
caagcaggac 
cttggacgtg 
acaggggctc 
gagaatgttc 
gcagataggg 
atggctcacc 
acggcccctc 
gaaaggatgt 
tagttaaagg 
gttcaggacc 
acagacatac 
ggaggctgcc 
gggcaggtga 
gtcctctgga 
ggaggctaga 
ggttcaaatc 
agctgtaaaa 



gacggacctg 
tacggggagg 
ggtttccttg 
actctcctag 
gtggaactgg 
tcatgaagtg 
ataatccagg 
agctgatctg 
aatccagctc 
acagtcccaa 

tggtagtttt 

taggatcaga 
cctccttcaa 
gctcaacatg 
cagccgggtg 
ccactgagac 
tctagagggg 
gaacagtagg 
ttgctctcaa 
tccgcagaac 
aggtggagaa 
acttcttgtc 
gtgtatttga 
acacccacta 
aattgttatt 
acttactagt 
tctattgtgg 
tggattgtaa 
gtctgtttgt 
gttcttcctt 
tttctccccc 
actttatttt 
tcataagaaa 
gcacaacaca 
ctgttcttca 
aatgaaatat 
ccgtgataac 
gaggggctgt 
atctcagcag 
ctaattaact 
gcataatggg 
taagataaca 
atatcactaa 
cctgccaaac 
atgggccatg 
aagtttaaaa 
cccaccagag 
cagcgctgca 
agcaatgccc 
atggcctgca 
gagatcgcca 
cagggtccct 
aaactgggaa 
aaactgacat 
taaaatgtac 
aggctggaat 
ttctgcaatc 
ccatttgatt 
agggaataga 
atcagtcctc 
ttctcaacag 
agccagtgtg 
cttgggacgg 
ctgggtagta 
ccagctctgc 
cagagataat 



6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
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cacaaaatcc accttgtaag gctgttgcag gattaagtga gttttaatgg aacatagtaa 10260 

gccccagtag gaggaggtga ttgtcatgag atgatggcca ggctgggctg cccccatgaa 10320 

gagggaactc aaaggcctgg ttggctttgg aaaccttcca tgaagcgtca ctgatggcag 103 80 

cccggccacg ctgctggact tgctccatgc ttggctcaag cacacacact tttsctgccag 10440 

cttccttggg gctttgggtg aagccactgg agttctgtct gcattcggat acttaagccc 10500 

ccaacccact cactagcatt cagtgagcag ctggcacttg cactgatttt cattatgata 10560 

ctaataatta atggattaat aactcaatca gaattattag tacacatccc ctatccatgt 10620 

gattttaaag tcaatttagc atattgtgct atgtgtactg tggccattga ccttttaaga 10680 

gactgcctcc tcttggcatt taaaatattt aatttaatta cataagtaat gctcgctgct 10740 

gccaatacag aaaataagaa aacaaaagca ttcagagcca ctcaactcag agatagctgt 10800 

cgccagcatt gtggtttgac aggcaccacc acgggcactg tgggttcctt gaaagggcac 10860 

agtcgtaagg ctgtggcaat gccagactga gtggcaggaa accgagccag atgtgtacat 10920 

cctcactcaa aagatgagtg aagaagtgtt ttgagctggg cagaatgagg cagggtagag 10980 

ctttcctggg gaaaaaggtt tgaagtggga agctacaaaa ggggagacag aacagaaagt 11040 

gcagagaggt gtgagggagg agggagtgtt tataaaaagc ccactgtagt ggtatatatg 11100 

tctcatcata aataatatcc acatattttt tgagacagag tctcactctg ttgcccaggc 11160 

tggagtgcag tggtgcaatc ttggctcatt gcaaactcct cttcccggat tcgagtgatt 11220 

ctcctgcctc agcctcccaa gtagctggga ctacaagtgc ctgccaccat gcctggctaa 11280 

tttttgtatt tttagtataa aaagggtttc gccacgttgg ccagcctagt ctcgaattcc 11340 

taggctcaag tgatccacct ccctcagtct cccaaggtgc tgggattaca aaaatgaatt 11400 

gtttataatg ttggtagcaa acatattgga tttccttact atgtgccagt ctccatttta 11460 

ggcactttcc atgtagaact caatcgaact ttacaataat cctttcaggc aggtattatc 11520^^ 

acccccattt tatgctggag aaattgaggc acagaaaggt taagcgacct gcccaagatc 11580^P 

acccagctag taagccgcac agccagattt aagtccaggc agcctggctt gagagtccag 11640 

gctcttaacc acaccatact cttcagcctc cctctgcatt gaatgaaatc agatagatgt 11700 

ctggggtacc atttaaaatc atggtataaa aatgaatctg tttgagtaat aggcaagctt 11760 

tttcctgctc agagacgtct agctaggtag cagcagcttc catcaccact ttggctttgc 11820 

tggcatttcc agccaaggga ccaggaggat gcccccctca tctctacttc ccagttcacc 11880 

aaatttccat ggctgaaagc caaacatggc ataaagtggc attttggatc ctgtaagcta 11940 

acatactcct gtccttttgg gatcccagca agcaggacca acactgatgc tgtctgcagc 12000 

aggtgggtag cttccccctg cctcttaggg cccttcttct ccttcagcat ggaggagaat 12060 

aggtttggtg caaaagcaga ggaactcagc actaaccgta cactctctcc tctccctgca 12120 

ggctgtgtga aggcgtgggc tcagtcaata tctgtaagtt gggtgtgtgt gggcatggct 12180 

ttctggggtt tcagcaatct tagtattaga ttgcatctaa ctgatgacca agcctctgtc 12240 

tatggagtgg tagacaaggg actcctgagc tgatgtctct gggatccaga caaactgatg 123 00 

acctcaaata tttgaccata catggatcca tctggaacac ccatgaagtc taagttaatg 12360 

aatggtgcaa attctgaacc cctatttggt tctgcaaaaa tagcttaagc cttttagtta 1242 0 

cacacagaat actactcaag tggtagcatg ttctagtaag aaaaaaagct aggtttatat 12480 

tctgatcatg tgggtctctg caggcaactt gtaaatctct atctaataat ccaatgtaca 1254 0 

cacagcccct gggccaggtg tctgattccc atccctccat tcccctcacc tctctagcaa 12600 

ccatatttct agatgactcc tctagtgtga tctatcagtt tttgctcaaa gattgtatga 12660 

acatgtctca acatcagcct ggtccaagta gagtttggtg tgcaatttgc taaccacact 12720 

cctgaagaag cagatggcag tgggttcagg tttgcctgta gtttaaaaag actcatgttg 12780 

ctggtccctc agagaaggaa gataattaat cattggttct aaaggattct gggtcataac 1284<^^ 

atgtctgaga tggaagagaa ttctgacctg tgtctctgtc cacatgtgtg agccattccc 1290<^ 

agggtggtgt ggtctgcggg gacctggact ccactgcctc ctgtggctca gggggtgtgg 12960 

ccatcagcag tggtgcactg tgttgcccct ctgcagtggg ggcctgctcc agtgcccaat 13020 

ctgtgcggtt tgcatagagg gggtgggact ctggacggac ctgttctgat gggggcagtg 13080 

tgggttgggg agtggaaatc ttttctcttg catgaccaaa tgtcagttta gtaatttccc 13140 

cttacagaga acccagaact aactctcctg tttccttctt attggagtta acaatttact 13200 

ttctctgtgt aaaccctggg tttgacatta ttttcacttc cattgcttct ccccttccct 13260 

gtggttcagg aatgaaatag aaaggctttt ttttgaatga atttctcggt cattgtcttc 1332 0 

cccaattaac ttcaggccag cagatgttaa attaagtcct gactgtgtgc cagatcctct 13380 

tggaacggag ccagctagag ggtgtggagt atgtgggaac atctctgcgc tttgccctcg 1344 0 

gatgacaggg gctccagatg ctgc 134 64 
<210> 371 

<211> 2508 

<212> DNA 



<213> Homo sapiens 
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<400> 371 
agctctcccc 
tcgcactctg 
atgtcgtgcc 
tgctcagctg 
ggggtgtcct 
tcctgcgggc 
ggctaccgct 
aacgagagcc 
caggaggaga 
gtgcgcttcc 
cagcgctgct 
c 99gaggccg 
caggaggtgc 
gcagagaatg 
gacctggagg 
gaagaggaga 
gacaacagcc 
gatgttgcca 
atgaaggcca 
gagctgaacc 
gccaagctgg 
gcccgctgca 
tgcctgctca 
gccacctaca 
gtgaatgtct 
accaccccag 
gtggcccctg 
age egg t egg 
gcctgcctgc 
atcttctgga 
ccctgcgcct 
gaggcattcc 
gtattattcc 
agctccacct 
agaatcaggg 
cceagcaagc 
cctgcttctg 
ecattategt 
tgctattcag 
cttgctgggt 
cttcctgggt 
ctgtgtttca 
<210> 372 



aecaataaaa 



ccgcccgcac 
gctcctacag 
tggcccccaa 
gctaccgagg 
cccggatagc 
ccgggggcgt 
tcctcacgcc 
aggagcagat 
tggagcagca 
gcgagagcaa 
agtgcgtgga 
tggagggcta 
agtttgtcgt 
ccaatgtgga 
tccgcgttct 
gagacctgaa 
gccgcagccg 
cggtgatcag 
gcatgatcca 
aggctgetgt 
agctggctga 
aggagtacca 
ggcgcctgct 
gtgtcagcag 
ggegecagat 
actcctgtgc 
tccgctttgc 
atcactgcac 
tgttcctaca 
tttcatgeta 
cagcccttca 
cccagctctc 
tctgggtgga 
agaaagtccc 
acttgectgt 
ggctaacctg 

ggggcatcct 

tgaacctttg 
gcaggtgggg 
ctccttctta 
ataaatgecg 



ggaccaggga 
caccttccgc 
gatcagctca 
aactggcaac 
getgaeggge 
tgtaggtggc 
gtgcggaccc 
cctcaacctg 
caagtccctc 
gaacaagctg 
cctggagcca 
ggccgacagc 
caagaagaag 
tctaaagaag 
ggccctggtg 
ccaagcccac 
catggactgc 
ggecgagget 
gcatggggag 
gaggctgacg 
ggctgaggca 
gctggagggc 
ggaggtgatg 
ggagggegag 
ctcccgtggt 
cacttctggc 
cccctgccag 
ctagtagagt 
tgaatggcat 
gagggaaaga 
gggagatgea 
cttaactcct 
agggacaaga 
ggcgccatcc 
tccccagcag 
tcattcccgt 
ttccagccag 
tttgectaca 
tacatttcaa 
actgtccaat 
tgactgtcta 
ctgeaatgea 



ggatcagaga 
tgcctctcag 
ggatgcgggg 
cgctgctgca 
ttcggcagcc 
ttccgagccg 
agccccccat 
gagatcgacc 
aacagcaggt 
ctggagacca 
ctgttcagtg 
gggaggctgg 
tatgaagagg 
gacgtggact 
gaggagtcta 
atctcagaca 
ateategctg 
gagtcctggt 
accctgcgcc 
gecgagattg 
gagcagcagg 
gccctgcaga 
aactccaagc 
gaacacaggc 
ggagtctcct 
ccctcagcca 
cctcgttcct 
catggageca 
gtgaatggaa 
cctacagagg 
tcctagttgt 
acttagctcc 
ccagtccccc 
taaccatcca 
ccccctcctc 
tcatgttttg 
gctcctcatg 
gcccctggtt 
agaagactcc 
gcagagctgg 
tgggtgcatt 
aaaaaaaaaa 



gagcagaagg 
actctgctca 
tcaccaggaa 
tcagcgccgc 
gcagcctctg 
gctcctgcgg 
gcatcactac 
ecaaegcaea 
tcgcggcctt 

a gtggcagtt 

gctacatcga 
cctcagagct 
aggtggccct 
gtgcctacct 
gcttcctgag 
cctcggtcat 
agatcaaggc 
acegtagcaa 
gcaccaagga 
agaatgecaa 
gtgaggegge 
aggecaagea 
tgggcctgga 
tgtgtgaagg 

gtgggggcct 

taggeggcag 
ccagcttcag 
gggcttcctg 
aatgtgtgct 
gaaagaccct 
cctcctggca 
aaaatacctg 
agcgtggtgg 
gccaggccac 
ctggctggga 
cttctctctc 
tgacctcgca 
agggcacttt 
atggctgctc 
egggacagag 
gccttctggg 
aaaaaaaa 



atcctgagcc 
gcctcacacg 
cttcagctcc 
cccctaccga 
caacctgggc 
acgcagcttc 
cgtgtcggtc 
gtgcgtgaag 
catcgacaag 
etaccagaac 
gaetctgegg 
caaccatgtg 
gagagecaca 
geggaaatea 
gcgcctctat 
agtcaagatg 
tcagtatgac 
gtgtgaggag 
ggagatcaac 
gtgccagcgt 
cctcagcgat 
ggacatggcc 
catcgagatc 
tgtgggctct 
ctcctacagc 
catcaeggtg 
ctgcgggagt 
ccaagcacct 
tgcttccaga 

c gggccgctc 

gctgttttca 
tatccaattt 
teagcaegga 
ceacaacceg 
agaatggtcc 
agactgeett 
gttgagaagc 
ggacaggtct 
cagatgcccc 
agttaagcca 
ttgtctcgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2508 



<211> 2404 
<212> DNA 



<213> Homo sapiens 



<400> 372 



60 
120 
180 



gctcacccat tctacacttg etcttttget cccaaccagg aagecatcat gtcttaccac 
S£S£5 J ca *? tct W tcaeegggtg ggeaaettel getcttJSS Igeia?ga?a 

ccacagaacc tgaatcgett ccgggccaac tctgtctcct gttggagtgg glctoqattS lao 

a?SSacta tagtC ^ a9t etcateaect StggtSS ??c2?ccgg H°o 

SSSEfSZ taggctctcg gcccatccac tgtggagtcc gctttggtgc tggctgtggg 300 

gt9at ^ a 9 aggtgttggt ctggggccta gggctgacag ctgtgttS?? 3so 

aafl?^ 9 ^t 99CaS ^cattggc tatggctttg Higccctgg ctttggt?2 «S 

agagttggag gggttggagt cccagcagcc ccatctatca cagctgtgac totalacaaa tin 

25Si«2* CCC * CCtcaa <=ctggagatt gaeeeeaatg ccLgfcgg? gS^algga? tlo 
?SSH-r aaatcaa ? ac ccteaacaae aagtttgcct ccttlatt£a eaagg???gg 

ttcctagagc agcagaataa gctcctagag accaagtaoa acttcchcrp Itt^Z^ZZZ Z** 

tgtatcagga gcaatctgga gccactcttc gagaglgf* SSS ?S! 



BHC03 1 051 



194- 



ttggaggtgc 
gtcctagagg 
aatgagtttg 
gaggccaacg 
gaaatccagt 
agccgtgacc 
gccaggcgca 
gtgacagctg 
acccgcctga 
ttggaggctg 
tgcaagctgg 
ctgtgcgagt 
tacaggcgcc 
atatccgtca 
tccaccctct 

ggggtcctgg 

gacctgctga 
agcgtcccct 
tccagcgtcc 
cccagacagc 
ttcgtgggct 
gcactccccc 
gatgctccat 
tggatcactc 
gtttctcttc 
gcggacaggc 
cagccctgtc 
ggtgggctgc 
tcac 

<210> 373 



tggtcagtga 

gcttcaagaa 

tggctctgaa 

tggataccct 

tgctgcagtc 

tgaaccttga 

gccgggctga 

gccaacactg 

tccagaggct 

cagtggccga 

cagatctgga 

accaggagct 

tgctggaggg 

gcagctcccg 

cccgcggcgg 

cttcctgtgg 

gcactggcac 

gccccctgcc 

gctttgtgtc 

tgctgcccag 

ctgggcttga 

gattattctg 

tagcagttcc 

ctctccctaa 

ctgggttgtt 

gggctttgca 

cactgcaggg 

ctgtgtgcac 



tcaggcccgg 
gaagtatgaa 
gaaggatgtg 
aactcaggaa 
gcacatctca 
tgggatcatt 
tgctgaggcc 
tgacaacctg 
taaggcagag 
ggccgagcag 
gtgtgccctg 
gatgaatgcc 
cgaggagagc 

gggcggcctg 

ggtcaccttc 
ccccagcctg 
aaggagtggc 
cacccagggg 
caccaccacc 
agaagaacca 
cggtctccag 
cctggcccca 
agctaagctg 
accccgagat 
ggtgtaactc 
atgatccccg 
ttgtgtctgc 
tactctttgt 



ctccaggctg 

gaggaagtgg 

gatgcagctt 

attgactttc 

gagacgtcgg 

gctgaggtca 

tggtaccaga 

cgcaacatac 

attgagcacg 

cagggcgagg 

cagcaggcca 

aagctgggcc 

cggctctgtg 

gtgtgcgggc 

tcaggtagca 

ggtggagccc 

tccatgctca 

ggcttcagca 

tcctgccgga 

gctccacggc 

ctcccctttc 

tgtcctcttg 

gcttctccct 

ggcttttctc 

cagcatgcaa 

tactcctgcc 

aagctccaag 

gttcctgggt 



agaggaacca 

tatgtcgggc 

tcatgaacaa 

taaaaacgct 

tcattgtgaa 

aggcccagta 

ccaagtatga 

ggaacgagat 

ccaaggctca 

cgaccctcag 

agcaggacat 

tggacatcga 

aaggtgttgg 

ctgagccttt 

gcagcgtctg 

gggtcgcccc 

tcagcgaggc 

gctgcagcgg 

ccaagtactg 

tcctgcttct 

tgccaggaag 

gaggattttt 

gcctcccaat 

ccagtggctt 

gtccacgagg 

cctgctcgtg 

gaggcctggg 

tttcaataaa 



cctgcaggat 

caatgctgag 

gtctgatctc 

ttacatggag 

gatggacaac 

tgaggaggtg 

agagatgcag 

caacgaactg 

gcgtgccaag 

tgatgccaaa 

ggcgcggcag 

gatcgccacc 

accagtaaac 

ggttgccggc 

tgccaccagt 

ggccactggg 

ctgtgtcccc 

cggccgcagc 

agagcccagc 

gcccccaggg 

ccacccctta 

ctgccatgta 

ttctgtcctt 

ctctccggct 

tggggtgcag 

gtgtgaggcc 

ggttgggtgg 

cttgccaagc 



780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100f 
2160 
2220 
2280 
2340 
2400 
2404 



<211> 2681 

<212> DNA 

<213> Homo sapiens 



<400> 373 

acatctcagt 

atcctgcccc 

gtaccactct 

tgtcatgccc 

tggtaggggc 

tgggaggccc 

gggagccacc 

ggfccccsctg 

gcagatcaag 

gcagaagaac 

gaccaacatc 

tgtgtccggg 

gggctacaag 

tgttgccttg 

cgcagaggca 

cctgctccag 

gctggacgtg 

cagcaaagcc 

tgggaaccac 

gatccagcgg 

tgccatagct 

ggcagggctg 

atatcaggag 

cctgctggag 

gagcagctcc 

cactggcgtc 



cctccatccc 

ggttctttcc 

ttccagccag 

cggatggtca 

ctccgagctc 

cgggtagcct 

tgtgggcctt 

gcacfcggaga 

tgcctcaaca 

aagctgctgg 

gagcccatct 

gaccgcgtga 

aaaaaatacg 

aagaaggacg 

ctcgtgcagg 

tctcagatct 

gacggcatca 

gaagcagagg 

tgtgacaacc 

ctgcagcaag 

gaggcagagc 

gaggaggctc 

gtgatgaact 

ggtgaagagc 

aaaggcgcct 

ctcaggagca 



cctaccagat 

tccagggccg 

gctccaggtg 

cccactatgc 

tgggctgcct 

ccaggtgtgg 

ctgcctgcat 

tagacccgac 

accgtttcgc 

agaccaagtg 

tcgagggcta 

ggctagagtc 

aagaggagct 

tggacacagc 

agatcgactt 

ctgagacctc 

tcgctgagat 

cctggtacca 

tccgcaaccg 

aaaccgagaa 

agcagggcga 

tgcagaaggc 

ccaagctggg 

acaggctgtg 

tcctgtacga 

atgggggctg 



aaaagggggg 

catttctctg 

tggcagtcag 

agtgagcaag 

tggctcacgg 

aggtaccctg 

cacccctgtc 

tgtgcagagg 

atctttcatc 

gaacttcatg 

tatcagcgcc 

agagctctgc 

ctccctgcgt 

cttcctgatg 

cctgaaaagc 

ggtcattgtg 

caaggcgcag 

gtgccggtat 

taagaacgag 

tgtcaaagcc 

ggcggctctc 

caagcaggac 

cctggacatc 

cgaaggcatc 

gccatgtggg 

cagcatcgtg 



aagctgagcc 

cctctctctc 

agtttcagct 

gggccatgcc 

agcctgtgca 

cctggcttcg 

accatcaatg 

gtaaagaggg 

aacaaggtcc 

cagcagcaga 

cttcggcggc 

agcctccagg 

ccctgtgttg 

aaggctgacc 

ctgtatgagg 

aagatggaca 

tatgacgaca 

gaggagctga 

atcctggaaa 

cagcgctgca 

aatgatgcca 

atggcctgcc 

gagatcgcca 

gggcccgtga 

gtcagcacgc 

ggcactggtg 



tgacttgatc 

ctgccatgtc 

catactcggc 

ggcccggggg 

acgtgggctt 

ggtaccgact 

agagcctgct 

atgagaagga 

gtttcctgga 

ggtgctgcca 

agctggactg 

ctgcactgga 

agaatgagtt 

tggagaccaa 

aggagatctg 

acagccggga 

tcgccagccg 

gagtcacagc 

tgaataaact 

aacttgaggg 

agtgcaagct 

tgctcaagga 

cctacaggcg 

atatctcagt 

ctgtcctcag 

aactctatgt 



60 
120 
180 
240 
30« 
360" 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



BHC03 1051 



cccctgcgag 

aggagctggg 

cagagcccct 

ttccaaaacc 

agctgagctg 

gcagaccctc 

tcttaaccgc 

ctgtggatca 

cactgccctt 

tgaaacatgc 

aagccaagtg 

aatcaagagg 

tgagaataaa 

caaagggcaa 

ttttctcaat 

ctgttcccca 

gccccagctg 

ctacctccca 

tgaacatctg 

<210> 374 



ccccaggggc 
ggcagctccc 
gccccatggc 
ccacctccct 
cccctaggac 
agctaggccc 
agaatggcaa 
tcccatggaa 
actttaggct 
acactgcaaa 
ccacgatttc 
tggcctttgg 
tctttctcct 
agggatatgg 
ggaatcagtt 
accctttagc 
acttggaggc 
gaagcccctc 
ggcttctgtt 



tactgagctg 

ccagccacaa 

cctggatgtc 

ttattgactc 

ccctcctttg 

agatggtggg 

taacccccaa 

gtttcctcac 

ctcatgcccc 

tatcaaaagg 

aggatcagtc 

gcagtagatt 

gagctcacat 

aatgtttact 

ccccctccac 

tccttttgct 

cctggctcag 

atgccaagcc 

caaaataaac 



- 195- 

tgggagcggg 

gcattagcat 

acactcccac 

cacattcccc 

cctcatgagt 

gacagacggc 

aaggtcaaat 

aaagtcagct 

taagaaaagt 

agacaccccc 

aggacctgcc 

tactttctca 

tggcaggtga 

ggacagttgg 

ccatcccctt 

ttgagtctga 

gagctcatag 

tgtcccactg 

ctactctgtt 



cggaaatcca 
gatccgagac 
cagggctgaa 
tccagtgttc 
caccttccac 
agagaaagag 
gggcacacca 
cctctcccac 
gaatttcctt 
aaatcccact 
aatgtcctct 
ggtatgttga 
tgtcaggcca 
aaaaccaaaa 
ctgtacatag 
tatgacagtg 
atggaggagg 
gggctccagg 



gcatgacgct 

atccaggaga 

gacaaggatg 

cctccttgag 

tcacctgtgt 

catgcattgc 

cagttctgct 

cacaggctca 

taacaccgcc 

ctaaattcca 

cagcacagag 

acccaaactg 

atgcctagaa 

acagatacca 

ccgccccttc 

atgtggggct 

tgtgggctga 

gcctctgccc 



<211> 11529 

<212> DNA 

<213> Homo sapiens 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2681 



<400> 374 

cagagtatat 

atggacatat 

acccagtaat 

tgccagcccc 

accccattgc 

agaggaggaa 

ctgttaggga 

tctcttacct 

ctaaatggag 

atgtccacat 

tggatgtgga 

acgctcagcc 

gcaggagaaa 

ttagggcaca 

cctgtggtca 

acataggtac 

aaggctctga 

atcccataga 

gttcacatat 

ttgctgcctt 

ccactaaaca 

attgttttta 

gtctaaccaa 

cacaagttca 

gatgactatt 

tcacacttgc 

acacagatac 

cagagtcagt 

actatttaat 

cacctcacaa 

ttcacagtgc 

tgattaggta 

tataacatgt 

gaaggctaaa 

ctgttgaaga 

ctgtgcaggg 



ggacaactga 

taaactttct 

atgatggtgg 

tctgactctt 

accaacatct 

gagcctgcca 

tcccatatcc 

ctgactgggg 

cttcctgcaa 

cgacaacatg 

gttaaagaac 

agtgcctggg 

cagcctctgc 

gaggtggaat 

agaatgctga 

aaaaagcatg 

gatagcatac 

atctcactct 

tcactctaat 

ctcccaggac 

taatgagatg 

cttattttta 

aattagtaac 

ggttttagag 

taagtttaaa 

cacatttcag 

agagcaattc 

ggatttagga 

ctctcaaagc 

gagcattgag 

ctggcacata 

cgtgtagtga 

atgcaagcgt 

gcttagcatt 

gtgtgttcct 

ccaggcgatg 



gtccagattc 

gccaaggacc 

cgtgttcttt 

cctgcttctc 

gcttggctct 

cctggactct 

cagagccctc 

caggtgtggt 

ggccaaaaaa 

aagcagcagc 

atccaggacg 

ccagggtcct 

tctccagaaa 

cagaagtaaa 

ggctttagtg 

atacacacag 

caaagatgac 

cacatgaatg 

ccaggcaaag 

cacacacaca 

ctcaaatgct 

tttcttattt 

tggatcatga 

ttgtgctgtc 

ttaattaaaa 

gtgttcaatg 

tgttatcgtg 

tcttggctca 

cccaggttct 

attattaaat 

gtaagttgtt 

tacagacgca 

ggactaacag 

gaaaaggaag 

agtcaatgac 

cttgcccttc 



taggtctaga 
tcctgcccat 
tcttgaatta 
ccctctccaa 
ctatgttcca 
gagatccagg 
tgggaggctc 
ttctggaaca 
ccaccagggc 
tggactgcct 
tggtagggga 
gccctggggc 
tgtccagtct 
cataaaagtg 
aagggctggg 
cattcttcaa 
acaaagtggc 
caactaaatc 
aggctggaga 
cacacacaca 
catgcctgta 
ttctggttct 
gttttggtta 
caacatagta 
ttaaataaaa 
gccacacatg 
gaaagttctc 
accacttaat 
ttatctgtta 
caaaacttgc 
aataagtgtt 
gacaagagaa 
tgtttcaata 
aaggcacagg 
caggcagtgt 
agcagaggga 



gagatcatgt 
ttaatggaat 
tcacacatgc 
acaaaaccat 
ggttccttcc 
ttgtgcagag 
ctgcatggaa 
ggagaacaaa 
caattttgag 
gggtggaggg 
cttcaagaac 
tccaggggag 

a gtggggaaa 

tcaagagaac 
attgtagatg 
actctgcaga 
aactcaggag 
tcactgctaa 
acaaaaaaga 
cacacacaca 
gatacaaatt 
aattattttt 
acactgtttg 
gccaggagcc 
tttaaaattc 
gctaatggat 
ctggatagca 
ggctgaatga 
aaagagacat 
atgtaaggtg 
agctatgttt 
gcaagtgcag 
caagaagact 
ggagggatat 
tcctgtggca 
ctgtgagctg 



cagaaagaaa 
tcaaaacttc 
acctggttgg 
ctgtccccca 
aggctgagga 
cctccctgca 
gctctctcag 
gtcctggaga 
cccatgtttg 
tatgtgaagt 
aagtgagcag 
gtcagagggg 
cggaacacac 
tgggacccac 
atgataccta 
taaggaaagt 
ttaagaatga 
tttgcacagt 
aaggacacat 
cacacacaca 
tcttgatttt 
aattctcttg 
tatactctag 
agtggccaca 
agttcctcag 
cctgtaatga 
ctatagactt 
ccttaagaaa 
caatagtgcc 
cttggtaggg 
attgtattta 
atgggtaatt 
tcctggagga 
ggtagggcat 
tcaaagtggc 
ggccacatgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



BHC 03 1 051 



196 



cagttgtatg 
aaacgatgag 
aaatttcaat 
aagccaggaa 
aatgacaact 
gcagagcagc 
gcacccaagt 
agagatcagt 
ctgcctacag 
aagtctctgc 
gttgtacagt 
tgggttgggg 
ttatgggacc 
agaggggatc 
gatttggggc 
ctctgcagac 
gcccaatcca 
aggtgcactg 
tgcccagctc 
taggtcttac 
gctacctagg 
ccaacaccag 
tgtcctgggc 
cagctccacg 
cacaatgtga 
gcacccctga 
aagttctaag 
gctctcccac 
agtgatagca 
ccaaagtgga 
gagtgcccgg 
gcacagagcc 
tgcagggcag 
caggtcagga 
caggcagaaa 
acttggctca 
tgatgcaatt 
cagcacaatt 
caaatacaac 
acagagataa 
cagtagaaag 
gagcatgtca 
aagtataaat 
cagaggggag 
agtgcctcca 
atgcccaaca 
ggtgggctcc 
ctctacttgt fcctctitcaat 



aagacgagga 
caactcaaag 
cacaatactg 
ggaaatataa 
gttttaaaag 
tggctgatgg 
gtgtgctgat 
cttccttccc 
ggtggggttg 

tatggtcctg 
aaatgcgagt 
ctccctgccc 
atatgcccct 
aagtagactg 
tttgatttgg 
gctaaggctg 
cactaccagc 
agaactcaac 
gagttcctcc 
agcctagaag 
cttgcccagg 
gctgcaaatc 
aaacgggcct 
agcccagaag 
gggttgtcct 
gggcaggtgg 
cctcctctcc 
gggagctgaa 
taaggcccag 
gggctggagg 
aaggcggtga 
cctcagctgg 
acctgcgtat 
gagcggctgg 
ttctgttggg 
tttctgagca 
aactgcaggg 
tgggagcaaa 
cagtgctgag 
gaaaggcaca 
agagaaatga 
tgaaggaatt 
aaggaatggg 
atggctgggg 
gggtgagtgg 
cacacacgtg 
tgattgtttg 



cccacagcca 
tatttgggaa 
gtatgaatca 
gcctactgat 
aattctgatt 
tgatgctgat 
ctgaaaatgg 
gatgtcattt 
tctgggatgc 
cctttctgct 
gccaaattgg 
ttctttcaca 
acatctgtga 
ggagaatgct 
aaagaacagc 
caggccactc 
atcacagcct 
atgcatgcac 



gcaagcctgt 
aaacaagccg 
gtcatcaata 
tctcaaattt 
tagtaagtct 
gctgatgctg 
accattaaca 
aatctccaca 
tatttaaatg 
atttccagct 
gaaacgcaac 
agatattttc 
tactgacagg 
ggctggagtt 
tataaactca 
catccctacc 
cccaaaacat 
acacatacat 



gagttaggtg 
gaaaaccaaa 
ggcaagcaag 
gatcctgctt 
ttccattaca 
tgagatcaaa 
aggttgggtg 
gagcaggagg 
gtaaattgaa 
tccttaggcc 
gaggggctgc 
caagaccata 
caaatactcc 
tgttttgttg 
catgtgtgtc 
ccaccattgc 
accccaaacc 
aggcagagaa 
ccagtggcct 
tttaacagtg 
taatcaggag 
cccccgccgt 
tgggtctcta 
gagtcctggg 
ggtgatgcag 
aggccctggt 
ttctactcca 
ctaattcaca 
tgttctgtgg 
tgccctgatg 
tgccccctcc 
tgtgggctcc 
atgccacttc 
gagatgaaaa 
aatggaccac 
tcctgccctg 
acacaaatca 
gcctggttct 
ggggcctgta 
tcaaatcact 
caatcatagg 
gccaggaagg 
gcccccgggt 
agtggtgaag 
ggtaggcctg 
gaagctggtc 
aattccacct 
gctgacaaca 
gggacaagac 
cagcacaata 
aagagagaaa 
taaagagcat 
gggctgggcc 
gtttgaggtc 
aattaattgt 
aattcatttg 
tgattggcag 
gatttccatt 
tggaaagtag 
cttatttggt 
tcacaacatg 
acatgtgctg 
atgacctccc 
gtgcaagacc 
gcccccacac 
gcactcgcat 



aagaagagac 

ttgctcttcc 

agattttctc 

catgtaagtc 

tagtagagaa 

accacagctg 

gagaatgggt 

cacctccaca 

gaatgagaaa 

acagttcagt 

cggcctctga 

aaaaaaaaaa 

aaggagctga 

ctttgggttt 

ataggagagc 

ctgtccacct 

acttcttgat 

tgagtttgtg 

catacaaggc 

ggaaaactgc 

tgagtcagga 

gtggagtcca 

ggcacccagt 

ccttcctgct 

gagacctttc 

cttcacagag 

tggcctggag 

gacagagggg 

gacatggatg 

gaggagacca 

aacagggggg 

ttccctggtc 

ccctaccaca 

attcagcctt 

ctgtcattaa 

ccaaaattta 

agtggcaatt 

ccgtttgtgt 

agttcttgaa 

gtactatttc 

cagtgggtgg 

catggtgggt 

gtgatccagg 

atagggccag 

gtaaggacag 

acattgttcc 

tacaacccac 

ggg tgatatt 

ctttgaacta 

agaaaataat 

aggaatattt 

cagaatcttc 

cacagtgttg 

acattttgag 

gggttcactg 

aggccagtat 

atacaagact 

tcccatccta 

ccaagttttc 

gtgaagggct 

gtgtgctcct 

ctgctccctc 

taatgttctt 

ctcaatgcca 

acacatgcat 

gcacatacga 



attcagagtt 
cagagaacac 
agatcctata 
atgtgtgatg 
agatcctctg 
tctagcacca 
cacctggaac 
tgcccctggg 
cctttggggg 
gtgcctgaac 
cctttctccc 
aaaaaaaaaa 
gctgtgagaa 

gggttgtgtt 

caggctacag 

ccaccctact 

ccttcccaga 

gtgctcaaga 

cttatttaat 

aattcagaga 

cccaggactc 

gtgtggagac 

caggctcagg 

tggtggccag 

aatgtggcac 

tttggaagtt 

tagacagatg 

gctggttatc 

ctgtctacat 

acttcctgag 

ctgggggctg 

ttctctctgc 

ggagaaaatg 

aaccatctgg 

aatatggatt 

cctgctgctc 

agtgagccct 

cctccacaga 

gcttacagta 

aggccctaca 

gtgacactgg 

gggggcagtg 

aatgccagga 

cagtgtcaag 

gtgatatgcc 

aggacctggg 

agggggataa 

gcctccgaat 

gatctgctgc 

aatttgttga 

ttacttagac 

tagaagtgtt 

catttcaatc 

aaccactgag 

tttccagacc 

tatgatcagc 

caaatttgtc 

tccccgagga 

tggcttcttc 

gtattttctc 

gtgacaacag 

cacgcagatg 

cccagccctt 

gggtctgttc 

acacatgctc 

acgcacatgc 



cattaaaagg 

tgagagaaag 

gagcagcaga 

ggactctgtg 

agaggtcata 

cgagggtggg 

agaagcaaaa 

ggacagacag 

agagggttcc 

agagtgaggg 

tgttcctctc 

agggaaatgt 

aagaaacctc 

tggtggttta 

tggagcattt 

"gctctgacct 

ggtatgaaga 

tgagtggaca 

ttgcacagca 

ggtaaagaca 

caaagtctga 

aggggaagca 

aggacagggc 

ggccagaaca 

cagagctgat 

ctactccctc 

ctctgtcctt 

cctggaaggg 

gaacaaagtg 

cactttctat 

ggtcttagag 

tccttatccc 

aggttccata 

atcaagcatg 

tccatagaac 

ttcctagagc 

caggtgcttc 

gctgcaggca 

tagtgcagga 

gaggcaatga 

acaagaaact 

ctccagcaag 

ggaggacagt 

gtggggtgta 

catagggtca 

gcttcagctc 

ccagccccct 

tctgaactag 

caggaaaaac 

aacgggaaat 

ttctgctttg 

gctaaacacc 

aagtttccag 

ttagatgatc 

atcttatctg 

cttgctgaga 

agttgggtgc 

gcagggagga 

tgactcagtt 

agaacatgag 

agtggcttct 

agctgatgac 

tcagcctaca 

cagggggcag 

acacatgcac 

acacttgcac 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480^ 

354d9 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

480(> 

486^ 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 
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acacacacac 
accacaaaag 
gatggaaaac 
tgaagacatc 
acatcaaggt 
taaaagccat 

tggggctcac 

aggcaggagg 
atcagagaga 
gagggagggt 
agaagaaagg 
acagactggg 
atttgcaatg 
aatttgggag 
gaagggaaag 
gtgagggaca 
agggaggctc 
tccctggcct 
ccctcctagt 
ccaagatgga 
cttgtagaag 
ggtggacttg 
ccttccagga 
ggattgaaga 
tgctttactt 
tgtggtgact 
aggtcaggag 
caaaaattag 
caggggaatc 
actccaacct 
aaaaggacct 
acagaaatca 
cttcagggaa 
catgaacaat 
catcgcctgc 
ggagatagcc 
gccaatgtat 
gggcttgaag 
gaccacgaca 
agctcccagg 
gcagcccctc 
gggacgctaa 
caaggctcag 
acagctgcct 
tcacagatgc 
gtgcctgata 
cttaacctct 
ggtgtcagag 
acaatggaag 
ggcagggatg 
tccccatgag 
actttctcct 
ccctcacccc 
tcgtgggtcc 
acacaaataa 
tatgaattca 
agcagcagtg 
catcataggc 
gacccatgtg 
aaccaaccca 
atcaatgaga 
cccgttgctg 
gggttctggt 
tctatggctc 
aagacagggg 
tfc 99gtatgc 



ggacacacat 
ctggctcagc 
aactgcaacc 
gccaacaaca 
caagttcatt 
gtcctctcag 
cctatgttac 
tggtcagtcc 
aaagggacag 
ggggacaaga 
cggaactggc 
attttccaga 
ccgaggaaaa 
ggtgagatgc 
agcacagcca 
gagtcaagag 
tcagacgcac 
ggctgtccct 
acaaggagct 
gatctctgag 
caggtagggc 
tgcctgggcc 
atataaatat 
atgcgagaaa 
actcgcaatc 
cacgcctgta 
ttcgagacca 
ccaggcatgg 
gcttgaacct 
Srggcgacaga 
cagaacatca 
atcttagttt 
ttctttcata 
gtcacacaca 
tgatgccaag 
ctgcagaagg 
gtcaagctag 
agcaggtggg 
gcaatgccac 
tcagaaacag 
ccacagctgt 
gcaagaccct 
gggctctgga 
cttctcattt 
acttgagctt 
catgtaagca 
cagcaaacaa 
ccgtcagtac 
ctgggagcat 
tcacaaagac 
gtcaagagaa 
cactctggct 
cagggctggt 
ccagtctgaa 
gaggctatta 
acaaatagaa 
acctcacaga 
ggcattaatg 
ggtcagatca 
tcagaggggc 
cgtttccaaa 
aaaatattga 
actgggggag 
ccttccagaa 
aatgaaccca 
cacttgatct 



gcacatgcac 
ttcaggccca 
tggacctgga 
gcagggctga 
aaacaagcac 
atgtcccagt 
ctgggggaag 
ctctgccaag 
ctggacagtt 
ggaaagttta 

tggggcccag 

tgagagcaaa 
cacgaacaaa 
atggtgtcga 
cagctgtcca 
aaaatgcaga 
atgaaagcct 
gcccactgca 
acagcactct 
ataaactggt 
tgagctccca 
tcactcagcc 
gcattcacct 
ggcactttct 
cagaaaagtt 
atcccagcac 
gcctgacgaa 
tggtgcgcgc 
gggagatgga 
gtgaaactcc 
ttttctccat 
taaagatgat 
ttttcaactt 
ttctcatcgc 
cagcatggga 
ccaagcagga 
ccccggacat 
ccacccctcc 
agaggcccca 
ggttaaaatg 
tggacccagg 
ttcttcccaa 
attgccctgc 
ctctctgtat 
cccaaggcta 
ctctgtgctg 
aatgactctt 
ctgtgagtca 
cactcttgag 
cctcttggag 
ctcaaaaggg 
taatctccac 
cccagaagat 
tgcctgattt 
aacagattaa 
cagtgactga 
gtgacccacc 
ggagcattga 
cattagagag 
aggaatgcct 
tgtttacctg 
agcagaagct 
gctcgtccag 
tacaaggcct 
gcacaggagc 
ctttgcacct 



actctgcctc 
gatctctgaa 
tagcatcatc 
ggctgagttc 
aaaaaaccga 
ctgatagggt 
catgtcagga 
aggaagatgg 
ccagccatca 
accccttgtg 
gggctggagg 
cattctgccc 
ggaaaagtga 
gggaaaggga 
gctgtagggt 
tgaatttgat 
atggggctgt 
cgcatccccc 
gctagcctgc 
aatgcagagg 
ggtgtccttc 
cctacagaga 
agtgcaatgg 
tctggaagag 
cagcattgaa 
tttgggaggc 
catggtgaaa 
ctataatccc 
ggttgcagtg 
atctcaaaaa 
tctggctaag 
gcagctccaa 
aaagttcata 
cacctggcag 
agctggcact 
catggcgcag 
tgagatcgcc 
cctcccacta 
tgagcgccac 
caattgctcc 
gcaggggtga 
gtcctgtgag 
aagctgacgg 
ttgtgactcg 
gcaagctgag 
ataactttaa 
tctctcccta 
atgtcttgga 
ggtctggtgg 
gctctatggg 
gcacattcca 
aaaaactgtg 
cccagtgaca 
ctcagctagt 
atgtaaaggc 
tctggcgtgg 
agggctctgg 
gtcctggtca 
gctggcctgc 
caccaggaaa 
gcttgcaaag 
gaacaattgt 
gtggccttca 
cctcttccat 
taaaagcctt 
cattttttcc 



agctctctca 
acctccatgg 
attgaagtca 
tggtactaaa 
gatttcaaag 
caagggatct 
cctggggaac 
acacacctcc 
gtgccaacag 
gaggaggctg 
gcaaagtctt 
tgggagtcgt 
ttggcaccta 
agaaagacca 
gaagccagtg 
tgggccagag 
gggaagccac 
atccgaggtc 
acgggatgac 
ctgcactctg 
catccatgct 
gaagcctccc 
ttaatattgt 
taaatggcaa 
aaggacctca 
tgaagagggc 
tcccgtctcc 
agctactcag 
agcagaaatc 
aaaaaaaaaa 
tctcaaacta 
cctcctttgc 
gaggatctta 
tgtgccatgc 
aagaacaagc 
cagctgtgca 
acctacagga 
ccacccaaac 
agtgtcagcc 
cttaaaggcg 
gggtcaaagg 
aaatgcaatc 
tctcgcacta 
ttctctctcc 
gggagagctt 
atacaaagaa 
gcggtcacag 
atggtagaaa 
gtcaaagatg 
gactcttctt 
tgtcactaag 
gcgattaatc 
ggacaagact 
gctgtcagga 
ctatatttag 
cctcaactca 
ggctgtgtgt 
tgggagtaga 
caggattggg 
aacggacgcc 
atgcatccaa 
atcttggcaa 
ggcctctttc 
ccaatgtgac 
ggcttctggt 
tctacaaaat 



tttggaccca 
tccttctgtt 
aggcccagca 
ccaaagtgag 
gcaaacaaga 
gagttcagct 
agccagataa 
aatgccctgt 
gaggaacaga 
gaggatccag 
aatctaagtc 
tgggatgata 
cctgatgctg 
ggtataggaa 
ggcatagaaa 
gcggcccagc 
agccttctct 
tgcttctcct 
ctccacacca 
agattgataa 
cctgcctcca 
agatgagtgg 
caacttgatt 
aactttggct 
gagggctggg 
ggatcacctg 
actaaaaata 
gaggctgagg 
gtgccagtgc 
agaaagaaag 
cccaaggtgg 
attaaatcca 
ctgcatcctc 
cccaggccac 
tggtggagct 
ggtaccagga 
agctgctgga 
agatgcacca 
ctccagatga 
tcctgcagtt 
ggtcaaggaa 
atgcttggga 
cctctctacc 
tggattttcc 
gccaagcaca 
cccagggccc 
actcactgaa 
agtggggggc 
agcaggcagg 
gtgcccttct 
gagctcaaca 
tgcaggtgac 
attcaaacaa 
actcaaacta 
aattgaagca 
caggagggag 
aaaagctcag 
atccccttga 
gagagcctgg 
aagaggggaa 
gtagggagtt 
tgtcattgag 
gaaaaggaag 
agccgcctac 
gggcgtggac 
gggggttatt 



6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
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gtagtaatgg 

ttgagtatgg 

agtccacgcc 

agggttgcct 

gggcgtcgtg 

aggtctcttc 

gtggccacag 

tgaccagcag 

agagtcacag 

cagctccaag 

aggagcagga 

acagaggtac 

ttccaaccaa 

agaagcttcc 

cagcaaccac 

gtgccccaaa 

gatggaacag 

cctcctcacc 

ccagcaggat 

tcataatata 

aatgaatgag 

acctgctgtt 

tgtgagggtg 

cctattttaa 

aggctcaga 

<210> 375 



tgtcctctca 
cgatgccctg 
caccctctct 
ggaacaaggc 
ggaaatggga 
ccctggggcc 
cagtggcctc 
tcacagcagc 
cagccatgca 
gcttgatatt 
agtggcttat 
cccacctgct 
tgttgactca 
agagtagggg 
aagacaaggc 
gagactaagt 
ttttcccact 
tcctttcttc 
atttgagtga 
atggaatgat 
agttcatggc 
cctaaacaga 
tgggaaggcc 
ttctcctgca 



gccccattga 
tgcagaggtt 
cacactggct 
attgcgggag 
agaacctgcc 
acaggctcca 
tgccacaaca 
tccaacatat 
cctgcctcca 
tactgacgtc 
gaggtcatcc 
cccgatctct 
actgctctgt 
gtgcagtagg 
ctcttatgca 
gcagtgtggg 
caggccagga 
cccctgccca 
gaaatcaaca 
gttatcaaga 
caaatttctg 
aatgccctga 
acttggccca 
ttaggaactg 



gatccacatg 
catgagaagg 
caggcgttcc 
gggagcaggc 
aacatatcct 
gtgggggcag 
acaggagcag 
tcattgtctc 
ctgccactcc 
ctacacacaa 
tgtccatccc 
gcagggtcct 
gccaggacct 
gtctttcccc 
gtccttccca 
cccaagtttc 
cacatcttgc 
ggccccagcc 
cccttcaaaa 
actcaaaagc 
tcaagtctgc 
tgacagagga 
aaatgtttgc 
ccctgtgaga 



gtcacagctg 
tagcgtgtga 
ccgaggctct 
atggggatca 
gagccctctc 
cgtcctctgt 
tggcatcagc 
caagagatgg 
cctcaagccc 
aatcaacagc 
tctgcctcca 
gcagggtcca 
gggctagcac 
tcctgtctaa 
ggacctcaga 
cacacacctt 
acaccctgcc 
aagtcccctg 
fcatgatcaac 
cctccatggc 
tttatttaag 
gtgggaggga 
aagaaataca 
ttcctgcaaa 



tgaatgtgcc 
cggggtggtg 
cagggaaggg 
cacccaggaa 
tccacagctg 
gtgggtgggg 
tgtccagctg 
cccaccaagg 
tcaagcttcc 
aggaggtaca 
gaaggtcccc 
caggcatcca 
tttgccctta 
gtcagaccca 
actgaccacg 
tctcattgca 
acacatgcca 
acaactccca 
tcttgcttat 
ccagggctgc 
tcaatattag 
agcaggctca 
ccaggaaaag 
ataatcctcc 



10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460t 

11520 

11529 



<211> 2218 

<212> DNA 

<213> Homo sapiens 



<400> 375 
catggccagc 
cagctcagcc 
ctccaggggc 
tctgtatggc 
tggcggctat 
tggtttcggt 
tggctttggg 
caaccagagt 
ggccgaggag 
ggtgcggttc 
gcagggcacc 



aaacatgcag 
cacagcagca 
caaggttgaa 
cttgtatgat 
atccatggac 
atatgaggag 
cgaggagctg 
gattgctgag 
gcagtgtgcc 
caaggatgct 
cctggcccgg 
ggagatcgcc 
cgttggacaa 
cagcggtgtc 
tcttggcgtt 
ctctgttgga 
gagctacaag 
ctggcagcag 



acatccacca 
aggctccctg 
agtggtggcc 
ctggggggct 
ggcagcagag 
ggtggagccg 
ggccctggct 
ctcctgactc 
cgtgagcaga 
ctagagcagc 
aagactgtga 
cagctggaca 
gacctggtgg 
gagaatgaat 
ctgcaagcca 
gcagagctgt 
aacaaccgca 
attgctcaga 
caggtcacag 
atcaaccgca 
aacctacagg 
aagaacaagc 
ctgctgaagg 
acctaccgca 
gtcaacatct 
ggcagtggct 
ggaggcggct 
ggcggcagtt 
cactgaagtc 
agccctctcc 



ccatcaggag 
gggtcagccg 
tgggtggcgc 
ccaagaggat 
ccggaggcag 
gcattggctt 
tccctgtgtg 
ccctcaacct 
tcaagaccct 
agaacaaggt 
ggcagaacct 
gcatcgtggg 
aggacctcaa 
ttgtgactct 
aggcagacac 
cccagatgca 
acctggacct 
ggagcagggc 
caggcagaca 
tgatccagag 
ctgccattgc 
tggaagggct 
agtaccagga 
agctgctgga 
ctgtagtgca 
taggcctggg 
ttagttccag 
ccaccatcaa 
gtgccgccag 
tcaggttgct 



ccacagcagc 
ctctggcttc 
atgtggagga 
ctccattgga 
ctatggcttt 
tggtctgggt 
cccccctgga 
gcaaattgac 
caacaacaag 
tctggacacc 
ggagccgttg 
ggaacggggt 
gaacaaatat 
gaagaaggat 
tcttacagat 
gacccacatc 
ggacagcatc 
tgaggctgag 

tggggacgac 

gctgagatct 
tgatgctgag 
ggaggatgcc 
gctgatgaac 
gggcgaggag 
gtccaccgtc 
tggaggaagc 
cagcggcaga 
gtacaccacc 
ctctcagtcc 
tgtcctcccc 



agccgccggg 
agcagcatct 
gctggctttg 

gggggcagct 
ggtggcgccg 

ggtggagccg 
ggcatccaag 
cccgccatcc 
tttgcctcct 
aagtggaccc 
ttcgagcagt 
cgtctggact 
gaggatgaaa 
gtggatgctg 
gagatcaact 
tcagacacat 
atcgctgagg 
tcctggtacc 
ctgcgcaaca 
gagatcgacc 
cagcgtgggg 
ctgcagaagg 
gtcaagctgg 
tgcaggctga 
tccagtggct 
agctactcct 
gccactgggg 
acctcctcct 
cacagctctc 
tggcctccag 



gtttcagtgc 
ccgtgtcccg 
gcagccgcag 
gtgccatcag 
ggagtggatt 
gccttgctgg 
aggtcactgt 
agcgggtgcg 
tcatcgacaa 
tgctgcagga 
acatcaacaa 
cggagctgag 
tcaacaagcg 
cctacatgaa 
tcctgagagc 
ccgtggtgct 
tcaaggccca 
agacaaagta 
ccaagcagga 
acgtcaagaa 
agatggccct 
ccaagcagga 
ccctggacgt 
atggcgaagg 
atggcggtgc 
atggcagtgg 
gtggcctcag 
ccagcaggaa 
aggcccctct 
tctcccctgc 



60 

120 

180 

240 

300 

360 

420 

480 

54(A 

600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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cctcccgggt agagctggga tgccctcact tttcttctca tcaatacctg ttccactgag i860 

ctcctgttgc ttaccatcaa gtcaacagtt atcagcactc agacatgcga atgtcctttt 1920 

fcagttcccgt attattacag gtatctgagt ctgccataat tctgagaaga aaaatgacct 1980 

atatccccat aagaactgaa actcagtcta ggtccagctg cagatgagga gtcctctctt 2 040 

taattgctaa ccatcctgcc cattatagct acactcagga gttctcatct gacaagtcag 2100 

ttgtcctgat cttctcttgc agtgtccctg aatggcaagt gatgtacctt ctgatgcagt 2160 

ctgcattcct gcactgcttt ctctgctctc tttgccttct tttgttctgt tgaataaa 2218 
<210> 376 

<211> 1986 

<212> DNA 

<213> Homo sapiens 



<400> 376 

cgcgccaacg ctcgccacag ccctctcatc tcctggaacc atggccagca catccaccac 60 

catcaggagc cacagcagca gccgccgggg tttcagtgcc aactcagcca ggctccctgg 120 

ggtcagccgc tctggcttca gcagcatctc cgtgtcccgc tccaggggca gtggtggcct 180 

gggtggtgca tgtggaggag ctggctttgg cagccgcagc ttatatggcc tggggggatc 240 

caagaggatc tccattggag ggggcagctg tgccatcagt ggcggctatg gcagcagagc 300 

cagaggcagc tatggctttg gtggcgccgg gagtggattt ggtttcggtg gtggagccgg 360 

cattggcttt gatctgggtg gtggagccgg ccttgctggt ggctttgggg gccctggctt 420 

ccctgtgtgc ccccctggag gcatccaaga ggtcactgtc aaccagagtc tcctgactcc 4 80 

cctcaacctg caaattgacc ccgccatcca gcgggtgcgg gccgaggagc gtgagcagat 54 0 

caagaccctc aacaacaagt ttgcctcctt catcgacaag gtgcggttcc tagagcagca 600 

gaacaaggtt ctggacacca agtggaccct gctgcaggag cagggcacca agactgtgag 660 

gcagaacctg gagccgttgt tcgagcagta catcaacaac ctcaggaggc agctggacaa 720 

catcgtgggg gaacggggcc gcctggactc ggagctgaga aacatgcagg acctggtgga 780 

ggacctcaag aacaaatatg aggatgaaat caacaagcgc acagcagcag agaatgaatt 840 

tgtgactctg aagaaggatg tggatgctgc ctacatgaac aaggttgaac tgcaagccaa 900 

ggcagacact ctcacagatg agatcaactt cctgagagcc ttgtatgatg cagagctgtc 960 

ccagatgcag acccacatct cagacacatc cgtggtgcta tccatggaca acaaccgcaa 102 0 

cctggacctg gacagcatca tcgctgaggt caaggcccaa tacgaggaga ttgctcagag 1080 

gagccgggct gaggctgagt cctggtacca gaccaagtac gaggagctgc aggtcacagc 114 o 

aggcagacat ggggacgacc tgcgcaacac caagcaggag attgctgaga tcaaccgcat 1200 

gatccagagg ctgagatctg agatcgacca tgtcaagaag cagtgtgcca gcctgcaggc 1260 

tgccattgct gatgctgagc agcgtgggga gatggccctc aaggatgcta agaacaagct 1320 

ggaagggctg gaggatgccc tgcagaaggc caagcaggac ctggcccggc tgctgaagga 1380 

gtaccaggag ctgatgaatg tcaagctggc cctggacgtg gagatcgcca cctaccgcaa 1440 

gctgctggag ggcgaggagt gcaggctgaa tggcgaaggc attggacaag tcaacgtctc 1500 

tgtagtacag tccaccatct ccagtggcta tggcggtgcc agtggtgtcg gcagtggctt 1560 

aggcctgggt ggaggaagca gctactccta tggcagtggt cttggcattg gaggtggctt 1620 

cagttccagc agtggcagag ccattggggg tggcctcagc tctgttggag gcggcagttc 1680 

caccatcaag tacaccacca cctcctcctc cagcaggaag agctacaagc actaaagtgc 1740 

tgcctccagc tctcggtccc acagtcctca ggcccttctc tggctgcaga gccgtcttct 1800 

caggttgcct gtcgtctcct ggcctctagt cttccctgct ctccgaggta gagctgggta i860 

tggatgctta gtgccctcac ttctctctgt ctatacctgc cccatctgag cacccattgc 192 0 

tcaccatcag atcaaccttt gattttacat cataatgtat tcaccactgg agcttcactt 1980 
tgttac " 

<210> 377 1986 

<211> 2222 

<212> DNA 

<213> Homo sapiens 



<400> 377 

catggccagc acatccacca ccatcaggag ccacagcagc agccgccggg gtttcagtgc 60 

caactcagcc aggctccctg gggtcagccg ctctggcttc agcagcatct ccgtgtcccg 120 

ctccaggggc agtggtggcc tgggtggtgc atgtggagga gctggctttg gcagccgcag 180 
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cttatatggc 

tggcggctat 

tggtttcggt 

tggctttggg 

caaccagagt 

ggccgaggag 

ggtgcggttc 

gcagggcacc 

cctcaggagg 

aggcatgcag 

cacagcagca 

caaggttgaa 

cttgtatgat 

gtccatggac 

atatgaggag 

cgaggagctg 

gattgctgag 

gcagtgcgcc 

caaggatgcc 

cctggcccgg 

ggagatcgcc 

cgttggacaa 

cagtggtgtc 

tcttggcgtt 

ctctgttgga 

gagctataag 

tggctgcaga 

gtcccaggta 

ccatctgagc 

caccactgga 

gaggctgagc 

gcttatttca 

gtgttgtgaa 

tacaaagcct 

aa 

<210> 378 



ctggggggat 
ggcagcagag 
ggtggagccg 
ggccctggct 
ctcctgactc 
cgtgagcaga 
ctggagcagc 
aagactgtga 
cagctggaca 
gacctggtgg 
gagaatgaat 
ctgcaagcca 
gcagagctgt 
aacaaccgca 
attgctcaga 
caggtcacag 
atcaaccgca 
aacctgcagg 
aagaacaagc 
ctgctgaagg 
acctaccgca 
gtcaacatct 
ggcagtggct 
ggaggtggct 
ggcggcagtt 
cactaagtgc 
gccctctcct 
gagctgggga 
acccattgct 
gcttcactgt 
attataagaa 
gaacaacttc 
cccccaccca 
gtatctctgt 



ccaagaggat 

ccagagcgag 

gcattggctt 

tccctgtgtg 

ccctcaacct 

tcaagaccct 

agaacaaggt 

ggcagaacct 

gcattgtcgg 

aggacttcaa 

ttgtgactct 

aggcagacac 

cccagatgca 

acctggacct 

gaagccgggc 

caggcagaca 

tgatccagag 

ccgccattgc 

tggaagggct 

agtaccagga 

agctgctgga 

ctgtggtgca 

taggcctggg 

tcagttccag 

ccaccatcaa 

gtctgctagc 

caggttgcct 

tgaatgctta 

caccatcaga 

tactaaatta 

aatgacctct 

cacttacttt 

ggcagtatcc 

gatgatttct 



ctccattgga 
ctatggcttt 
tgatctgggt 
cccccctgga 
gcaaattgac 
caacaacaag 
tctggaaaca 
ggagccgttg 
ggaacggggc 
gaacaaatat 
gaagaaggat 
tctcacagac 
gacccacatc 
ggacagcatc 
tgaggctgag 
tggggacgac 
gctgagatct 
tgatgctgag 
ggaggatgcc 
gctgatgaat 
gggtgaggag 
gtccaccgtc 
tggaggaagc 
cagtggcaga 
gtacaccacc 
tctcggtccc 
gtcctctcct 
gtgccctcac 
tcaacctctg 
ttaatttctt 
gctccttttc 
ccactggctc 
atgaaagcac 
gtgctcttca 



gggggcagct 

ggtggcgccg 

ggtggagccg 

ggcatccaag 

cccgccatcc 

tttgcctcct 

aagtggaccc 

ttcgagcagt 

cgcctggact 

gaggatgaaa 

gtggatgctg 

gagatcaact 

tcagacacat 

atcgctgagg 

tcctggtacc 

ctgcgcaaca 

gagatcgacc 

cagcgtgggg 

ctgcagaagg 

gtcaagctgg 

tgcaggctga 

tccagtggct 

agctactcct 

gccattgggg 

acctcctcct 

acagtcctca 

ggcctccagt 

ttcttctctc 

attttacatc 

gcctccagtg 

attgcagaaa 

tcaaactctc 



gtgccatcag 
ggagtggatt 
gccttgctgg 
aggtcactgt 
agcgggtgcg 
tcatcgacaa 
tgctgcagga 
acatcaacaa 
cagagctcag 
tcaacaagcg 
cctacatgaa 
tcctgagagc 
ctgtggtgct 
tcaaggccca 
agaccaagta 
ccaagcagga 
acgtcaagaa 
agatggccct 
ccaagcagga 
ccctggacgt 
atggcgaagg 
atggcggtgc 
atggcagtgg 
gtggcctcag 
ccagcaggaa 
ggcccctctc 
ctcccctgct 
tctctctata 
atgatgtaat 
ttctatctct 
attgccaggg 
taacttataa 



aagtgactag tcctatgatg 
ctctttgcaa ttgctaaata 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500^ 

156d9 

1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2222 



<211> 2270 

<212> DNA 

<213> Homo sapiens 




<400> 378 

ctcctccagc ctctcacact ctcctcagct ctctcatctc ctggaaccat ggccagcaca 60 

tccaccacca tcaggagcca cagcagcagc cgccggggtt tcagtgccaa ctcagccagg 120 

ctccctgggg tcagccgctc tggcttcagc agcgtctccg tgtcccgctc caggggcagt 180 

ggtggcctgg gtggtgcatg tggaggagct ggctttggca gccgcagtct gtatggcctg 240 

gggggctcca agaggatctc cattggaggg ggcagctgtg ccatcagtgg cggctatggc 300 

agcagagccg gaggcagcta tggctttggt ggcgccggga gtggatttgg tttcggtggt 360 

ggagccggca ttggctttgg tctgggtggt ggagccggcc ttgctggtgg ctttgggggc 420 

cctggcttcc ctgtgtgccc ccctggaggc atccaagagg tcaccgtcaa ccagagtctc 480 

ctgactcccc tcaacctgca aatcgatccc accatccagc gggtgcgggc tgaggagcgt 540 

gaacagatca agaccctcaa caacaagttt gcctccttca tcgacaaggt gcggttcctg 600 

gagcagcaga acaaggttct ggaaacaaag tggaccctgc tgcaggagca gggcaccaag 660 

actgtgaggc agaacctgga gccgttgttc gagcagtaca tcaacaacct caggaggcag 720 

ctggacagca ttgtcgggga acggggccgc ctggactcag agctcagagg catgcaggac 780 

ctggtggagg acttcaagaa caaatatgag gatgaaatca acaagcgcac agcagcagag 840 

aatgaatttg tgactctgaa gaaggatgtg gatgctgcct acatgaacaa ggttgaactg 900 

caagccaagg cagacactct cacagacgag atcaacttcc tgagagcctt gtatgatgca 960 

gagctgtccc agatgcagac ccacatctca gacacatctg tggtgctgtc catggacaac 1020 

aaccgcaacc tggacctgga cagcatcatc gctgaggtca aggcccaata tgaggagatt 1080 

gctcagagaa gccgggctga ggctgagtcc tggtaccaga ccaagtacga ggagctgcag 1140 

gtcacagcag gcagacatgg ggacgacctg cgcaacacca agcaggagat tgctgagatc 1200 
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aaccgcatga tccagaggct gagatctgag atcgaccacg tcaagaagca gtgcgccaac 1260 

ctgcaggccg ccattgctga tgctgagcag cgtggggaga tggccctcaa ggatgccaag 132 0 

aacaagctgg aagggctgga ggatgccctg cagaaggcca agcaggacct ggcccggctg 13 80 

ctgaaggagt accaggagct gatgaatgtc aagctggccc tggacgtgga gatcgccacc 1440 

taccgcaagc tgctggaggg tgaggagtgc aggctgaatg gcgaaggcgt tggacaagtc 1500 

aacatctctg tggtgcagtc caccgtctcc agtggctatg gcggtgccag tggtgtcggc 1560 

agtggcttag gcctgggtgg aggaagcagc tactcctatg gcagtggtct tggcgttgga 1620 

ggtggcttca gttccagcag tggcagagcc attgggggtg gcctcagctc tgttggaggc 1680 

ggcagttcca ccatcaagta caccaccacc tcctcctcca gcaggaagag ctataagcac 174 o 

taaagtgcgt ctgctagctc tcggtcccac agtcctcagg cccctctctg gctgcagagc 1800 

cctctcctca ggttgcctgt cctctcctgg cctccagtct cccctgctgt cccaggtaga i860 

gctggggatg aatgcttagt gccctcactt cttctctctc tctctatacc atctgagcac 1920 

ccattgctca ccatcagatc aacctctgat tttacatcat gatgtaatca ccactggagc 1980 

ttcactgtta ctaaattatt aatttcttgc ctccagtgtt ctatctctga ggctgagcat 2040 

tataagaaaa tgacctctgc tccttttcat tgcagaaaat tgccaggggc ttatttcaga 2100 

acaacttcca cttactttcc actggctctc aaactctcta acttataagt gttgtgaacc 2160 

cccacccagg cagtatccat gaaagcacaa gtgactagtc ctatgatgta caaagcctgt 2220 

atctctgtga tgatttctgt gctcttcact gtttgcaatt gctaaataaa " 2270 

<210> 379 

<211> 2301 

<212> DNA 

<213> Homo sapiens 



<400> 379 

tcgacagctc 

tgggtaacag 

ccactagtgc 

tgtgtccttc 

tgtctcccgc 

cggcagggtc 

caacctgggg 

gtttggtgct 

tttcggcggt 

cttcggtggc 

ccagagtctc 

cgaggagcgc 

gcggttcctg 

gggcaccaag 

caggaggcag 

catgcaggac 

cactgctgag 

ggtggagctg 

ctttgatgcg 

catggacaac 

tgaggagatt 

ggagctgcag 

cacagagatg 

gtgcgccaat 

ggatgccagg 

ggcccggctg 

gatcgccact 

tggaccagtc 

tggctatggc 

aggtagcagt 

cagtgtgggg 

cagtggcggg 

gagcttcaag 

catggagatt 

agtctagacc 

agcccctggt 

cacagcatgg 



tctcgcccag 
agccaccttc 
ctggttctct 
cggagcgggg 
accagcttca 
agccttgcgg 
ggctccaaga 
ggtgctggag 
ggagctggtg 
cctggctttc 
ctgactcccc 
gagcagatca 
gagcagcaga 
actgtgaggc 
ctggacagca 
ctggtggaag 
aatgagtttg 
gaggccaagg 
gagctgtccc 
aaccgcaacc 
gccaaccgca 
cagacagctg 
aaccggatga 
ctgcagaacg 
aacaagctgg 
ctgcgtgagt 
taccgcaagc 
aacatctctg 
ggtggcctcg 
ggaagctact 
ggctctggct 
ggtagcagct 
agctaagaac 
gcctcttcta 
aagccaattg 
ctcccgtgcc 
cccctgctga 



cccagttctg 

tgcgtcctgc 

tgctccacca 

gcagtcgtag 

cctccgtgtc 

gtgcttgtgg 

ggatatccat 

gcggctatgg 

gtggctttgg 

ctgtctgccc 

tcaacctgca 

agaccctcaa 

acaaggttct 

agaacctgga 

tcgtggggga 

acttcaagaa 

tgatgctgaa 

ttgatgcact 

agatgcagac 

tggacctgga 

gccggacaga 

gccggcatgg 

tccagaggct 

ccattgcgga 

ccgagctgga 

accaggagct 

tgctggaggg 

ttgtcacaag 

gtggaggtct 

actccagcag 

tcagtgcaag 

ccagcgtcaa 

ctgctgcaag 

ggcagttgct 

cagaaccaca 

gcagttctat 

cacgagaacc 



gaagggataa 
tgagctctgt 
ggaacaagcc 
cttcagcacc 
ccggtccggg 
agtgggtggc 
cagcactaga 
ctttggaggt 
gctcggtggc 
tcctggaggt 
aatcgacccc 
caataagttt 
ggacaccaag 
gccgttgttc 

ac ggggccgc 

caagtatgag 
gaaggatgta 
gatggatgag 
gcatgtctct 
tagcatcatc 
agccgagtcc 
cgatgacctc 
gagagccgag 
tgccgagcag 
99 a ggccctg 
catgaacacc 
cgaggaatgc 
cagtgtttcc 
tgSfcggcggc 
cagtgggggt 
cagtggccga 
atttgtctcc 
tcactgcctt 
caagccatgt 
ttctttggtt 
attctgcttc 
caaagttttc 



aaagggggca 

tctctccagc 

accatgtctc 

gcctctgcca 

ggtggcggtg 

tatggcagcc 

ggaggcagct 

ggtgccggta 

ggagctggct 

atccaagagg 

agcatccaga 

gcctccttca 

tggaccctgc 

gagcagtaca 

ctggactcag 

gatgaaatca 

gatgctgcct 

attaacttca 

gacacctcag 

gctgaggtca 

tggtatcaga 

cgcaacacca 

attgacaatg 

cgtggggagc 

cagaaggcca 

aagctggccc 

agactcagtg 

tctggatatg 

ctcggtggag 

gtcggcctag 

gggctggggg 

accacctcct 

ccaagtgcag 

tttatccttt 

cccaggagag 

aaatcagcct 

ccaaatctaa 



tcaccgttcc 

acctcccaac 

gccagtcaag 

tcaccccgtc 

gtggtggctt 

ggagcctcta 

tcaggaaccg 

gtggatttgg 

ttggaggtgg 

tcactgtcaa 

gggtgaggac 

tcgacaaggt 

tgcaggagca 

tcaacaacct 

agctgagaaa 

acaagcgtac 

acatgaacaa 

tgaagatgtt 

tggtcctctc 

aggcccagta 

ccaagtatga 

agcatgagat 

tcaagaaaca 

tggccctcaa 

agcaggacat 

tggacgtgga 

gagaaggagt 

gcagtggcag 

gtcttgccgg 

gtggtgggct 

tgggctttgg 

cctcccggaa 

caacccagcc 

tctggagagt 

ccccattccc 

tcaggtttcc 

atcatcaaaa 



• 60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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cagaatcccc accccaatcc caaattttgt tttggttcta actacctcca gaatgtgttc 2280 
aataaaatgc ttttataata t 2301 
<210> 380 

<211> 2255 

<212> DNA 

<213> Homo sapiens 



<400> 380 

ccttcctccc 

tgcaagtcgg 

ggcagctcat 

agcctctaca 

ggaggctatg 

gtggccctgg 

gtcaatgaga 

cgtgcccagg 

aaggtgcggt 

cagctggacc 

aacctgcgga 

aggaatgtgc 

cggacagcag 

aataaggtgg 

tgtctctttg 

ctgtccatgg 

cagtatgagg 

ttccaagagc 

gaaatctcgg 

aagcaggctt 

ctgaaggatg 

gagctggcgc 

atggagatcg 

tttccctccc 

ttccggccca 

gtgtgcagcg 

accctaggga 

ctgccccggg 

agcccactct 

cttggccacc 

agcttggtga 

tacaccacca 

tcgctaacta 

ctagcatagt 

ggacaaacaa 

agtgggacca 

aaagcaggca 

taaacagcac 

<210> 381 



tcctgcatct 

gagctgccgc 

cctccttccg 

gcctgggggg 

gatttggccg 

ggcctgtgtg 

gcctcctggc 

agcgagagca 

tcctggagca 

tgaacaactg 

agcagctgga 

gggacgtagt 

cagagaacga 

aactgcaggc 

aagccgagat 

acaacaaccg 

agattgcctt 

ttcagctggc 

agctcactcg 

ccaacctgga 

cccgggccaa 

ggatgctgcg 

ccacctatcg 

ctgtcagcat 

gcatggtcag 

tgagaggcgg 

agggttccag 

ccccgcctca 

cagctccatc 

cacttctccc 

aattcctccc 

gcccaggtcc 

ctctgctggg 

tcctgcctat 

ccaagagtgg 

agtgcaattc 

ctgttcactt 

aatctcaaaa 



gagctttgtc 

caaggggggc 

ggcagggagc 

tgtccggagc 

gggccgggcc 

cccaactgta 

ccccctcaac 

gatcaaggct 

gcagaaccag 

caagaacaac 

gacgctgtct 

ggaggactac 

gtttgtgctg 

caaggtggaa 

cactcagatc 

gaacctagac 

gaagagtaag 

agctggcagg 

gctcatccag 

gacagccatc 

gctggacgag 

cgagtaccag 

caagctactg 

ctccatcatc 

cggtggctat 

ggagggcagg 

cctgagtgca 

ttccatgacc 

tccaccctgc 

agcctgtgtc 

tgtacacacc 

tggctgccag 

ctccctgggt 

gccaggagct 

cccagtcccc 

agggcctttt 

ttcctttatt 

aaaaaaaaaa 



tccaccagca 
ttcagtggct 
aaagggctca 
ctcaatgtgg 
agtggctttg 
tgcccacctg 
gtggagctgg 
ctgaacaaca 
gtactggaga 
ctggagccca 
ggggacaggg 
aagaagaggt 
ctcaagaagg 
tccatggacc 
cagtcccaca 
ctggacagca 
gccgaggctg 
catggggacg 
agaatccgct 
gctgatgctg 
ctggagggcg 
gage teat ga 
gagagegagg 
agcagcacca 
gtggccaaca 
ageeggggea 
ccctccaaga 
cggctctgga 
tggtcctget 
ttcctgatcc 
ctattaactc 
ctttcctcct 
ctctgcccaa 
ggctctgtgt 
acccccacat 
tctccactca 
ettaatggee 
aaaaa 



acatgagecg 
gctcagctgt 

gtgggggctt 

ccagtggcag 
ctggaagcat 
gaggcatcca 
accccgagat 
agttcgcctc 
ccaagtggga 
tcctcgaggg 
tgaggctgga 
atgaggagga 
atgtggatgc 
aggagatcaa 
tcagtgacat 
tcattgacga 
aggccctgta 
acctcaaaaa 
cagagatcga 
ageagegggg 
ccctgcacca 
gectgaaget 
agtgcaggat 
gtggcggcag 
gcagcaactg 
gtgecaaega 
aaaccagtcg 
tcccacactg 
cccatacacc 
tgggaaggee 
cttggctgtg 
ctgcccggcc 
ggccccgcac 
ttaagaaaag 
ctagctcagt 
cctgcaccca 
ttcctctgtt 



ccaattcacc 
gctctcaggg 
tggcagcegg 
egggaagagt 
gtttggcagt 
ccaggttacc 
ccagaaagtg 
cttcatcgac 
getgetgeag 
ctacatcagc 
cteggagctg 
aatcaacaag 
tgcttacgcc 
gttcttcagg 
gtctgtcatc 
agtccgcacc 
ccagaccaag 
caccaagaat 
gaacgtgaag 
agacaacgcc 
ggecaaggag 
ggccctggac 
gtcaggagaa 
tgtctatggc 
catctctgga 
ttacaaagac 
gtagagaaga 
tacttcccac 
tggcactggc 
tggatgacca 
gtcccccagc 
tetagegcag 
acactgggac 
gaggactgaa 
ctcaaatctg 
gaagcagaga 
gcaacctcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540^ 

60(9 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860. 
192 
198C 
2040 
2100 
2160 
2220 
2255 



<211> 2856 

<212> DNA 

<213> Homo sapiens 



<400> 381 

ccttggagac tgettttetc cagctctgtc aactcaacct ttcccaccat gagteggcaa 60 

ctgaacatca agtccagtgg tgacaagggc aacttcagtg tgeattegge agtggtgcca 120 

aggaaggctg tgggtagect ggcttcttac tgtgcagctg gcagaggggc tggcgctggc 180 

tttggcagtc ggagectcta tagccttgga gggaategge gtatttcttt caatgtggct 240 



BHC03 1 051 



203- 



ggtggcggcg 
cgggccagtg 
tctgtgtgcc 
ctcaacgtgg 
aaggtgctga 
aaccaggttc 
aagaacctgg 
ctgtctgggg 
gactataaga 
gtggtgctta 
gtggactcac 
cagatccaga 
ctggaccttg 
agcaaggccg 
agtcggcatg 
atccagagga 
gccatcgctg 
gatgagctgg 
taccaggagc 
ctgctggagg 
gtcatcagca 
gccagcgctg 
gcgcgagggg 
aggaaagcca 
ctttgccacc 
gtctgtagta 
acacagtcac 
cgctgccttt 
aaggaagctg 
ggtttgttct 
gcatggtggg 
gctccatcag 
gtagaccacc 
gccagcctgt 
agcacctact 
aaagaagatg 
aactggcccc 
gctttttctg 
tttgatgagg 
ttccacagtc 
tgagctgagc 
gatgtcgtgg 
ttgaattaaa 
aaaaaaaaaa 
<210> 382 



ttcgggctgg 
gctttgctgg 
cacctggggg 
agctggaccc 
acgacaagtt 
tagaaaccaa 
agcccatcct 
acagggtgag 
agagatatga 
agaaggatgc 
tggacaaaga 
ctcacgccag 
acagcatcat 
aggccgaggc 
gtgacgacct 
tccggtgtga 
acgctgagca 
agggcgccct 
tcatgagcct 
gcgaggagtg 
gtagcagcta 
tggcaggcag 
gagacctcaa 
cccgctagac 
cccaaaggtg 
cactcaactg 
cttccctgtt 
ggctggcagg 
tgatcatccc 
gtgcttaggc 
gtgacacgag 
aaacaacttt 
agctgcaccc 
gtcacccacc 
agatgccaga 
caatgagcat 
agtcacccac 
tggtgtgtca 
gcagcgtaat 
tctctggggc 
tcagtgagtg 
ctcagggtgt 
gactctgaat 
aaaaaaaaaa 



aggttacggc 
cagtatgttt 
catccaccag 
tgagatccag 
cgcctccttc 
gtgggagctg 
tgagggctac 
gctggactcg 
ggtggagatt 
agatgcagcc 
catcaagttc 
tgagacctct 
cgctgaggtc 
cctgtaccag 
gaaacacacc 
gatcgggaat 
gcggggagac 
gcaccaggcc 
gaaactggcc 
caggatgtct 
cagctaccac 
ctctggcagc 
ggacacccag 
ccatggcctc 
tctgccacac 
tctgcatcgt 
tccttaggtc 
aggctttgga 
tagaaagagg 
caagttagct 
acctgtaagc 
gcaggatgac 
atctccacaa 
cacctccttc 
caccctacac 
ccttggaaca 
ctatgtgcca 
ctgccttttt 
tacattgcct 
tgtgtcctac 
gagctggctt 
aaatattcct 
ttctgctgaa 
aaaaaaaaaa 



ttcaggcctg 
ggcagtgtgg 
gtcactgtca 
aaggtgcgcg 
attgacaagg 
ctgcagcagc 
atcagcaacc 
gagctgagaa 
aaccggcgca 
tacgcagtca 
ctcaagtgtc 
gtcatcctgt 
cgcatgcatt 
accaagatcc 
aggagcgaga 
gtgaagaagc 
aatgccctga 
aaggaggagc 
ctggacatgg 
ggtgagaatc 
caccccagct 
acccagagcg 
ggcaagagca 
acccacctca 
ccaagttccc 
ggtttagctt 
ccatgtggac 
ggcacaagcc 
gaaggagcag 
actgtctgag 
aggtggtgag 
gccaatattt 
agccaagtga 
attatcagca 
atattgcctc 
gactgggagc 
gagtcaggcc 
aa caaaaggg 
tttgtcagga 
atagcttctg 
ttctgtcgct 
aagaatggca 
aaaaaaaaaa 
aaaaaa 



gctctgggta 
ccctggggcc 
acaagagcct 
cccaggagcg 
tacgcttcct 
tggacctgaa 
tgcggaagca 
gcatgaggga 
caacagcaga 
aggtggagct 
tgtatgatgc 
ccatggacaa 
atgaggagat 
aggagctgca 
tggtggagct 
agcgtgccag 
aggatgccca 
tggcgcggat 
agattgccac 
catcctctgt 
ctgcgggtgt 
ggcagaccaa 
ccccagccag 
gcacttggaa 
aggccctgag 
ttactttcaa 
taacgacttc 
attataacct 
gagacgacag 
aggttttaca 
aagttagaca 
aggacaccac 
aggtgtattg 
atagctacca 
ctattctcat 
ctgatcccaa 
tggccgacac 
ctgtcaaagt 
ctgtgggtat 
actctcaatt 
gagtggtact 
gcatctattt 
aaaaaaaaaa 



tggagggggc 
tgcatgtttg 
cttggccccc 
ggaacagatc 
agagcagcag 
caactgcaag 
gctggagaca 
tctggtggag 
gaatgagttt 
tcaggccaaa 
agagatcgct 
caaccgggac 
cgccctgaag 
gctggcagcc 
gaaccggctc 
cctggagacg 
ggccaagctg 
gctgcgcgag 
ctaccgcaag 
gagcatctct 
tgaccttggg 
gaccacagag 
catcccagca 
gaagaggtga 
ttttaaaact 
gctctgattg 
tcattttcct 
tcttggccct 
gggaggggct 
tcccctgcca 
gcctttactt 
gtgtacatct 
gggattctct 
tcatttgtca 
ccattcctgc 
gaggctgctg 
aaggatctat 
caccattctt 
gtgatttgtc 
tttggtgcca 
gcctagtcca 
cctcttttgt 
aaaaaaaaaa 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
'900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
, 2640 
2700 
2760 
2820 
2856 



<211> 1985 

<212> DNA 

<213> Homo sapiens 



<400> 382 



gcggctgttt gcgccccttc tgcctcccgc gcggccagca ccggtactta coaarra^ 

Ei =5 55SB5 BBSS 2SS5 I i 

cwcta™! "I™ 9a9C < * cte = t S«' «999ggcag SgaaII?tg gcS?ca2g 2« 

23SS 5ESS &25EI llkdi I 

Z33S& tSSSSS SSS5 32KK iSF f~S " s 

ac= t gcagaa 3 cagc t g 9 I g algctltctg ££££ ££SE SS£C St 
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ggaacatgca 

gcacagctgc 

ataaggttga 

gcctttatga 

tgtcaatgga 

agtacgagga 

tccaggagct 

aaatctctga 

agcagtgtgc 

tgaaagatgc 

agctggcacg 

tggagatcgc 

atccaaattc 

gcttcagcat 

tcaagaccaa 

gcagctgtgc 

aggcttccca 

aaatgctttg 

gtgatgcttc 

cgtttcgttg 

ggccagctac 

tcttcatctt 

atctg 

<210> 383 



ggatttggtg 
tgagaatgag 
gctccaggcc 
aggggagatc 
caacaaccgg 
gattgcccta 
gcaggtcaca 
gctcaaccgc 
cgatctggag 
ccgggccaag 
gatgctgcgt 
cacctaccgc 
tgtgagcatc 
gggctttggc 
aggcagctgt 
caccaaaaag 
actcacccgt 
tctgtcacta 
gatgcaagga 
ggtcattgcc 
aagcccctct 
ctgattggga 



gaggactaca 
tttgtggtgc 
aaggtggact 
actcagatcc 
gatctggacc 
aagagcaagg 
gcaggccagc 
ctgatccaga 
acggccatcg 
ctggatgagc 
gagtaccagg 
aagctgctgg 
tccgtcatca 
gcctcaagca 
ggcagtgagc 
gcctccagat 
ctcctccctc 
cagtcccaac 
gtccagtcag 
tgaaatgcaa 
gcccccatgc 
ttgtgtccag 



agaagaggta 
tcaagaagga 
ccttgacaga 
agtcccacat 
tggacagcat 
ccgaggctga 
atggggatga 
ggatccgctc 
ccgacgctga 
tggagggcgc 
agctcgtgag 
agagcgagga 
gcagcaccaa 
gttatagcta 
tcaaggatcc 
gatggacaag 
tccttccctg 
cccatttcct 
gagctctctc 
agacaacact 
ctttcttggg 
tcctctgctt 



tgaggtggag 
cgtggatgct 
tgagattaaa 
cagcgacacg 
cattgccgag 
gaccctgtac 
cctcaagctc 
agagataggg 
acagcggggg 
cctgcaccag 
cctgaagctg 
gtgcaggatg 
tgctggggca 
caaaactgca 
ccttgccaaa 
tggttggctc 
ggctcccttt 
ctccctgttg 
agtggcctcc 
tacccaggat 
ccttcactac 
cttctgcaat 



attaacagac 
gcttacatga 
ttcttcaagt 
tccatcgtcc 
gtccgtgccc 
cagaccaaga 
accaaggctg 
aatgtgaaga 
gactgcgccc 
gccaaggagg 
gccctggata 
tctggcgaat 
ggaggggctg 
gctgcagacg 
acctcgggga 
tgtgagcaga 
ccaagtcaag 
tcctcaaggt 
ttcttggtcc 
gtttccccaa 
caagggactc 
aaatgattaa 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1985 



<211> 2323 

<212> DNA 

<213> Homo sapiens 



<400> 383 

tcagcatctt 

cgggagctgc 

catcctccta 

acagcctggg 

gaggctatgg 

tggccttggg 

tcaacaagag 

gtgcccagga 

aggtgcggtt 

agctggacct 

acctgcggaa 

ggagcgtgcg 

gcacaactgc 

gcaaag tgga 

gtctgtacga 

tgtccatgga 

agtatgagga 

tccaggagct 

agatctcaga 

agcagtgtgc 

tcaaggatgc 

agctggcacg 

ttgagatcgc 

ataccaactc 

gggctggctt 

ggggctacag 

gggaagccag 

tagctctaag 

cagtagtctg 

atttgtcctc 

ctctctgcct 

tgcctccatc 



atccccactt 

tgccaagggg 

ccgagcaggg 

gggtgcccgg 

atttggccgg 

gtccgtgtgt 

cctcctggca 

gcgggagcag 

cctggagcag 

gaacaactgc 

gcagctggag 

cgaagtggtg 

tgagaatgaa 

gctgcaggcc 

gggggagact 

caacaaccgg 

gatcgcccgg 

gcagctagca 

gctgacccgt 

caacctggag 

cagggccaag 

gatgctgcgc 

cacctaccgc 

cgtgagcatt 

tggattcagc 

catgctgcct 

gaccaggctg 

ctcacccacc 

ccttgccctg 

atctccttct 

cagtctttgt 

caaagaggcc 



tctggcctcc 

ggcttcagcg 

ggcaaagggc 

agcatctctt 

ggccgggcca 

ccgtcgttgt 

cccctgaacg 

atcaaggtgc 

cagaaccagg 

aagaataacc 

acgctgtctg 

gaggactaca 

tttgtggtgc 

aaggtggatg 

gctcagatcc 

aacctggacc 

aagagcaagg 

gccggccggc 

ctcatccaaa 

acggccatcg 

ctggatgagc 

gagtaccaag 

aagctgctgg 

tcggtcatca 

aatgctggca 

gggggctgtg 

gggagtgcaa 
aaaaaaacca 
gcagactgct 
tgctttgctt 
taatgttgaa 
ttttaactca 



ccaccatgag 

gctgctccgc 

tcagtggagg 

tcaatgtggc 

gtggctttgc 

gcccgcccgg 

tggagctgga 

tgaacaacaa 

tgctggagac 

tggagcccat 

gggacagggt 

agaagaggta 

ttaagaagga 

ccctggatgg 

agtcccacat 

tggacagcat 

ccgaggccga 

atggggatga 

gactgcgctc 

ctgacgccga 

tggagggcgc 

agcttttgag 

agggcgagga 

acagctccat 

cctacggcta 

tcactggcag 

gtgaattcag 

tgagataaaa 

ccggactctt 

tgctgtggac 

tcaggatttg 

gcctcccctt 



ccgccaattc 

tgtgctctca 

cttcagcagt 

cagtggcagt 

tggcagcatg 

gggtatccat 

ccctgaaatc 

gttcgcctcc 

caagtgggag 

ccttgagggc 

gaggctggac 

tgaagaagaa 

cgtggacgca 

agaaatcaag 

cagcgacacg 

cattgctgag 

ggccctgtac 

cctgaaacac 

ggagattgag 

gcagcggggg 

cctgcagcag 

cgtgaagctg 

gtgcaggatg 

ggccgggatg 

ctggcccagc 

tgggaactgt 

ggactcccag 

gtgcaacaca 

ctaggaaatt 

ctctcctcct 

agagctggag 

ctggaagaag 



acctacaagt 

gggggcagct 

cggagccttt 

gggtgggcag 

tttggcagtg 

caggtcacca 

cagaaagtgc 

ttcattgaca 

ctgctacagc 

tacatcagca 

tcggagctga 

ataaacaagc 

gcttacacga 

ttcttcaagt 

tccatcatcc 

gtccgtgccc 

cagaccaagt 

accaaaaatg 

agtgtgaaga 

gactgtgccc 

gccaaggagg 

tccctggata 

tccggagaat 

gcaggcacag 

tctgtcagcg 

agcccccgtg 

ggaaagacct 

gcccatttcc 

cctctgtctc 

tctccttgac 

cccatctttc 

acttggcctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
66CJ 
720^ 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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ttctctgcct 
cctgtgctgc 
acaagttttc 
ctcaggcaaa 
aggagtaccc 
acttgtcctt 
cccaaaaaaa 
<210> 384 



gcatgtccct 
tctttcatac 
agatctactg 
acacctggtg 
cttgaggcta 
gtttaggcct 
aaaaaaaaaa 



gtgcagagga 
cctgtctcag 
atagacctgc 
ggaagttgta 
attctaccaa 
cacctccctg 
aaaaaaaaaa 



aggctgtctc 
taactatttg 
aggaactctc 
tcttgtatag 
ccaagggatt 
ctcctgtgtc 
aaaaaaaaaa 



tcaagggtaa 
gccatgtgct 
tcatgttaga 
ccactgcata 
tgtgctcttc 
ttaccaataa 
aaa 



ccccaatctg 
ccctgggtac 
gcaatactgg 
ttacatagcc 
atctgtgggt 
acttataaag 



<211> 2427 

<212> DNA 

<213> Homo sapiens 



1980 
2040 
2100 
2160 
2220 
2280 
2323 



<400> 384 
agcctgtgac 
cgaggaagag 
tctggtggaa 
ggcgggggct 
aagagcatct 
ggtggcagag 
ggtggcagcg 

ggaggttttg 

ggaggatacc 
aacgtgaaag 
actctcaaca 
caggtgttac 
aacctggagc 
actgcagaaa 
tataagaaga 
acgcttaaaa 
gacctgctga 
atacatcaga 
gacttggata 
aaggaagaag 
agacatggag 
cagaggctgc 
atcgcagatg 
gacctggagg 
caggagctga 
ctggagggcg 
acaagcagca 
agagggtcca 
cgacagtctg 
ggctctggcg 
atctctggag 
ggaggctcca 
ggtagctcag 
cccaccacca 
ctgtacctct 
ggaaaagaat 
ccagtgacgc 
atgcccattc 
gttgaggacc 
tgtgtttctg 
gggtcttcat 

<210> 385 



tttcctccct 
gaggaggtgg 
gccggagatc 
tcggtggagg 
ccattagtgt 
gaggtggttt 
gcttcagtgg 

ggggccctgg 
ctggtggcat 

ttgacccaga 
acaaatttgc 
agaccaaatg 
ccatcttcca 
gaacatcaca 
agtatgagga 
aggacgtgga 
accaggaaat 
gtgtcactga 
gcatcatcgc 
cggaggccct 
acagcctgaa 
agggggagat 
ccgagcagcg 
aggccctgca 
tgaacgtgaa 
aggagtgcag 
ccatttcatc 
gttccggagg 
gctccagagg 
gtggttctgg 
gaggatatgg 
gctctggagg 
gtgaagcttt 
ccgactctcc 
aactggcagc 
gtccacatac 
ctctgtgttc 
ccgcccccac 
caggcccatc 
tgtatatagc 
aataaacctc 



ggacaaaggc 
aggaggattc 
aacttccagc 
cggctttggc 
ggctggagga 
tggaggcggc 
tggtggtttc 
tggtgttgga 
ccacgaagtc 
gatccagaat 
ctccttcatt 
ggagctgcta 

ggggtatatc 

gaattcagag 
tgaaatcaat 
caatgcctac 
tgagtttctg 
caccaacgtc 
cgaggtcaag 
gtaccacagc 
agagatcaag 
cgcacatgtg 

tggggagcat 

gcaggccaag 
gctggcccta 
gatgtctgga 
aaatgtggca 
aggatacagc 
cggtagtgga 
aggaagatac 
ctctggaggt 
aggatatggc 
tggttccagc 
caacccagac 
aagttaaatt 
agtttttgaa 
tggggcggaa 
catgcctctt 
cagggatttc 
tcctcttgtc 
atttgca 



atcatgagtt 

°ggggcttca 

ttctcctgct 
agtcggagtc 
ggtggtggct 
agcggctttg 
ggtggaggcg 
ggtttagggg 
tctgtcaacc 
gtgaaggccc 
gacaaggtgc 
caacaaatga 
gacagcctca 
ctgaataaca 
aagcgcacag 
atgataaagg 
aaagttctct 
atcctctcca 
gcccagtatg 
aagtatgagg 
atagagatca 
aagaagcagt 
gccctcaagg 
gaggacttgg 
gatgtggaga 
gacctcagca 
tccaaggctg 
tctggaagca 
ggaggaggtt 
ggatctggtg 
ggaaaacaca 
tctggaggtg 
gtgaccttct 
tctcccactc 
tttgtcattt 
agatcttctc 
tctgtgctgt 
tgcattgccc 
atctctaagc 
cactctgctt 



gtcagatctc 
gcagcggctc 
tgagccgcca 
ttgttggcct 
ttggcgccgc 
gaggcggcag 
gctttggtgg 
gtcctggtgg 
agagcctcct 
aagagcgtga 
ggttcttgga 
atgttggcac 
agagatatct 
tgcaggatct 
ctgctgagaa 
tggagttgca 
atgatgcgga 
tggacaacag 
aggagatcgc 
agctccaggt 
gcgagctgaa 
gtaagaatgt 
atgccaggaa 
cgcggctgct 
tcgccaccta 
gcaatgtgac 
cctttggagg 
gcagttatgg 
ctatctctgg 
gtggctctaa 
gctctggagg 

ggggttctag 

cttttagata 
cagaatgtag 
atctctgatg 
tccaaaccag 
ctaggtttgt 
attttccaga 
ctggcagtgc 
tcggaagtgc 



ttgcaaatct 

agctgtggtg 

tggtggtggt 

tggagggacc 

tggtggattt 

cggctttgga 

aggccgcttt 

ctttgggcct 

gcagcctctc 

gcagatcaaa 

gcagcagaac 

ccgccccatc 

ggatgggctc 

tgtggaggat 

tgattttgtg 

gtccaaggtg 

gatatcccag 

ccgcaacctg 

ccagaggagc 

gactgtcggg 

ccgcgtgatc 

gcaagatgcc 

caagttgaat 

gcgtgactac 

ccgcaaactg 

tgtgtctgtg 

ttctggaggt 

ctctggaggc 

aggaggatat 

gggagggtcc 

tggctctaga 

ctctgtaaag 

aagatgagcc 

aagcctgtct 

gcactttgag 

ttagttagag 

gcttctagcc 

tgtgtattct 

tggggggaaa 

tgtggtctgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
;900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2427 



<211> 2507 



<212> DNA 



<213> Homo sapiens 



BHC03 1 051 



-206- 



<400> 385 

agctccttcc 

ttagttccag 

tcaactacca 

gattttcaag 

ggagtcttgt 

gacgtggtag 

gtggtggtgg 

gtggtggtgg 

ctgtctgccc 

tcaatgtgga 

agtcactcaa 

accaggtact 

ataatttaga 

tgaagagtga 

attaccggaa 

tgaccatcaa 

ttgacaactt 

agatgcagac 

tcgacctgga 

gcaaagctga 

gcagacatgg 

tccagagact 

ccatcagtga 

atgacctgga 

accaggagct 

tcctggaggg 

tgagcacaag 

acggctctgg 

gcggcggcgg 

gctatggctc 

ggggctacag 

gcggcgggag 

cctctggtgg 

aaagagatgc 

ttggcaagac 

cactagctca 

gatcagtggc 

gttcagcagt 

aggaccagtt 

gttttgtaca 

atttccttgt 

tttacggctc 

<210> 386 



cttactctac 

gtctgggtac 

gcgcaggacc 

ctgtggtggt 

taaccttggt 

tggctttggt 

ctttggtgga 

ttttggtgga 

tcctggtggc 

gattgaccct 

caaccaattt 

gcaaacaaaa 

gccctacttt 

tcaatctcgg 

caagtatgag 

gaaggatgtg 

gcagcaggaa 

tcaaatcagt 

cagcatcatt 

ggccgagtcc 

ggatagtgtg 

tagatctgaa 

tgcagagcag 

ggatgccctg 

gatgaacacc 

agaagaaagc 

ccacaccacc 

aggtagcagc 

ccgtggcagc 

tggaggtggc 

aggtggctct 

ctctggaggc 

cagttccagc 

cctctgtttc 

cgaggtcgat 

tcctattccc 

aatctcagtc 

gaccaccacc 

agacccaggg 

taaggtgtaa 

tttgctgcaa 

tgtctaacct 



cttgctccta 
cgaagtggag 
accagcagct 

ggtggtggta 

ggcagtaaaa 

ggtggttatg 

ggtggcattg 

ggtggctttg 

atacaagaag 

gagatccaaa 

gcctccttca 

tgggagctgc 

gagtcattca 

ttggattcgg 

gatgaaatca 

gatggtgctt 

attgatttcc 

gaaactaatg 

gctgaggtca 

ttgtaccaga 

agaaattcaa 

atcgacaatg 

cgtggcgaga 

cagcaggcca 

aagctggccc 

aggatgtctg 

atcagtggag 

tatggctccg 

tatggctccg 

ggcggcggcc 

ggaggcggcg 
tccataggag 
gtgaggtttg 
attagctcta 
ttgtcccagc 
ccagctcttt 
ccctggctat 
cacatgacat 
tgtggacagc 
gcaagttgtt 
taaactgcat 
ggatgggtgt 



cttttctcta 

ggggcttcag 

ccacacgccg 

gctttggtgc 

gcatctccat 

gtggtggtgg 

ggggtggtgg 

ggggtggtgg 

tcactatcaa 

aggtgaagtc 

ttgacaaggt 

tgcagcaggt 

tcaacaatct 

aactgaagaa 

acaagcggac 

atatgaccaa 

ttacagcact 

tcatcctctc 

aggcccagta 

gcaagtatga 

agatagaaat 

tcaagaagca 

atgccctcaa 

aggaagacct 

tggatctgga 

gagaatgtgc 

gtggcagccg 

gaggtggtag 

gaggtagcag 

atggcagcta 

gcggcggcag 

gccggggatc 

tttctaccac 

gttctccccc 

cttaccggag 

cttttctgct 

gaccctgctt 

ttcaaagcac 

tccttagcat 

tttcttttgt 

ttgaaattct 

tttgtgaggt 



agtcaacatg 

ctctggctct 

cagtggagga 

tggtggtgga 

aagtgtggct 

ctttggtggt 

ctttggtggt 

atatgggggt 

ccagagcctt 

tcgagaaagg 

gaggttcctg 

agatacctcc 

ccgaaggaga 

catgcaggac 

aaatgcagag 

ggtggacctt 

ctaccaagca 

tatggacaac 

cgaggatata 

agagctgcag 

ttctgagctg 

gatctccaac 

ggatgccaag 

ggcccgcctg 

gattgccacc 

cccgaacgtg 

aggaggtggc 

ctatggttct 

ctacggctcc 

cggctccgga 

ctctggcggc 

cagctctggg 

ttattccgga 

agcatcacta 

aaaagagcta 

gtttcccaat 

tgttctttcc 

ctccttaagc 

cttatctctg 

ggagaggtct 

ccatgtctcg 

aaaagaa 



agtcgacagt 60 

gctgggatca 120 

ggtggtggga 180 

tttggaagtc 24 0 

agaggaggtg 300 

ggtggctttg 360 

tttggcagtg 420 

ggttatggtc 480 

cttcagcccc 540 

gagcaaatca 600 

gagcagcaga 660 

actagaaccc 720 

gtggaccaac 780 

atggtggagg 840 

aatgaatttg 900 

caggccaaac 960 

gagttgtctc 1020 

aaccgcagtc 1080^ 

gcccagaaga 1140^ 

atcactgctg 1200 

aatcgtgtga 1260 

ttgcagcagt 1320 

aacaagctga - 1380 

ctgcgcgact 144 0 

tacaggaccc 1500 

agtgtgtctg 1560 

ggcggtggct 162 0 

ggaggtggcg 1680 

ggaggtggca 1740 

agcagcagt g 1800 

cggggctctg 1860 

ggtgtcaagt 192 0 

gtaaccagat 1980 

acaaatatgc 2040 

tggttagtta 2100 

gaagttttca 2160 

ctgagaaaca 2220 

cagccagagt 2280 

tgctgttttg 2340 

taaactcccc 2400 

atcgcccttg 246« 
250-r 



<211> 2224 



<212> DNA 

<213> Homo sapiens 



<400> 386 
atgagcagac 
gctgtggtct 
ggggcctatg 
ggcggcaaca 
gggggagggc 
tatggaggtg 
ggagctggtg 
ggtggctttg 
tttgcgcctg 



aagccagcaa 
ccggcagcag 
gcttccggag 
agagcatctc 
ggagcagctg 
gctttggtgg 
gctttggagg 
gtgggtctgg 
ggggctttcc 



gacatctggt 
caggatgagc 
cggagcaggt 
catcagcgtg 
tgcctttgca 
tggctttggt 
ggctggtggc 
tggctttggt 
tgggggaatt 



ggcgggagcc 
tgtgtggccc 
ggctttggca 
gcagctggcg 
ggtggctatg 
ggtggcagag 
tttggagggg 
gggcctggca 
caggaagtga 



agggtttctc 
actctggggg 
gtcgcagcct 
gctcccgggc 
gaggtggctt 
gaatgggagg 
ctggtggctt 
gcttgggcag 
ctactaacca 



cgggcgctct 
agctggcgga 
ctacaacctg 
tggaggcttt 
tgggagcggc 
tggctttggt 
tggtggtcct 
tcctggtggc 
gagtctcctg 



60 
120 
180 

240 
300 
360 
420 
480 
540 



BHC03 1 051 



207- 



cagcccctca 

cagatcaaga 

caacagaaca 

atctcaggca 

tacct.ggaca 

gacctggtgg 

gagaatgaat 

cttcaggcca 

gctgagctat 

aataatcgtt 

atcgctcaga 

cagaccacgg 

ctcaacagaa 

aacctgcaga 

aatgccaagc 

ctgctacgtg 

acctaccgca 

gctgtcagca 

ggaggttacg 

tcaggcattg 

acgagcggtt 

agtggcgggg 

tcccagtcct 

gcgcctcccc 

tccccaagaa 

ccggccaagc 

ccaggtgtgg 

tcactacagc 

taaa 

<210> 387 



aagtggagac 

ccctcaacaa 

aagtcctgga 

caaacaacct 

acatcctcgg 

aagacttcaa 

ttgtgactct 

aagtggatgc 

ctcagatgca 

ccctggacct 

gaagcaaggc 

ctggcaggca 

tgatccagag 

cggccattgc 

tccaagagct 

actaccagga 

agctgctgga 

tctccgtggt 

gcggaggcat 

gctttggccg 

tcagcggtgg 

gcttcagctc 

cccagcgcta 

tccgctcccc 

cgcggctctg 

ccagtcacga 

cctaggcgcc 

tctgatgtat 



tgacccccag 

caagtttgcc 

gaccaagtgg 

tgagcctctt 

ggagagaggg 

gaagaaatat 

gaagaaggat 

cttgatagat 

gagccacatc 

ggacagcatc 

cgaagctgag 

tggggatgac 

gctgcgggca 

gcaggccgag 

gcaggctgct 

gctgatgaat 

gggcgaggag 

cagcagcagc 

gggcggtggt 

99gaggcggc 

cagcggcttt 

ggccagcaac 

ctccagataa 

acgcccccat 

cgctggaagg 

gaatttcagg 

accttctgct 

atagcacaca 



attgggcaag 

tccttcattg 

aacctgctcc 

tttgagaatc 

cgcctggact 

gaggatgaaa 

gtggacagtg 

gagatcgact 

agtgacacat 

attgctgaag 

gccctgtacc 

ctaagaaata 

gagatcgagg 

cagcatggag 

ctacagcagg 

gtcaagctgg 

tacagcagga 

acgacttccg 

ttaggaggtg 

ggtggaatcg 

ggctccatct 

cggggcggca 

agagcacgca 

attggcaaca 

ctgtggcctc 

ctcttctctc 

gccccggagt 

gccccctccg 



taaaggccca 

acaaggtgcg 

agcagcaggg 

acatcaacta 

ctgagctgaa 

tcaataaacg 

cctatatgaa 

tcttaaggac 

ctgtggtgct 

ttggtgcaca 

agaccaagtt 

ccaagagcga 

gtgtcaagaa 

agatggccct 

cgaaggatga 

ccctggacgt 

tgtctggaga 

cctccgcagg 

gcttcagtgc 

gcggtggatt 

ctggcgcccg 

gcatcaagtt 

tcagcatcgc 

cccccagcgc 

ttgctcctcc 

cggctccagc 

cccctccctg 

cccctccggt 



ggagcgggaa 

gttcctggag 

cacaagttcc 

cctgcggagc 

gaacatggag 

tacagctgct 

caaggtggag 

cctctacgac 

gtccatggac 

gtatgaggat 

gggggagctg 

gatcatagag 

gcagaatgcc 

caaggatgcc 

cctggcgcgg 

ggagatcgcc 

gtgtccgagt 

tggctatgga 

gggcggcggc 

tggcggcggc 

ctatggagtg 

ctcccagtcc 

agccactcca 

caccccatgc 

ttccagatcc 

ctgtgattgt 

ggtgggatgc 

ctgtcgccaa 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2224 



<211> 2617 



<212> DNA 

<213> Homo sapiens 



<400> 387 
gacttgctcc 
taactccaca 
ggtgctgtga 
ggcaggggct 
gtgatgggat 
gctcaacggg 
tttttccctt 
gctgagggct 
gctctctgca 
ctgtggctcg 
tggaggtgct 
ggggaacaaa 
tgctggaggc 
tggtggcttt 
cgtctgcccc 
ccacgtggag 
gctcctcaac 
taaggtcctg 
aaaccttgag 
gggcaatgac 
cttcaagact 
ggtcctaaag 
ggacagtctt 
gatgcagacc 
ggacctggac 
caaggctgag 



ggtttgcaga 
ttctgtgggg 
gcctagcgag 
tctgattagg 
ccagtaacct 
tgcgggaagg 
tatgcccctg 
ccagcactca 
gccatgattg 
gccattgtag 
ggccgatgct 
agcatctcca 
tttggcactg 

ggtggtggat 

gctgggggaa 
attgaccctg 
aacaagtttg 
gagaccaaat 
cccctctttg 
aaagggcgcc 
aagtatgaag 
aaggacgtgg 
aatgacgaga 
catgtcagcg 
agcattattg 
gctgaagccc 



gctaggaggt 
tgagaggatg 
atggaggtac 
aagccctggg 
gtgccctcca 
atgggcttgt 
caaccaagcc 
ccggcctggg 
ccagacagca 
gcggtggcaa 
cttctggggg 
tgagtgtggc 
gtggctttgg 
ttgggggctc 
ttcaggaggt 
agatccagaa 
cctccttcat 
ggaacctgct 
agacctacct 
tgcagtctga 
aggagatcaa 
atgctgccta 
tcaacttcct 
acacgtccgt 
ccgaggtccg 
tgtaccagac 



ggcaggctgt 

ggtgatgggg 

agtgggtgtg 

gcaccagttc 

gatgacttct 

gccaagggcc 

ctgctgctcc 

ccctgtcact 

gtgtgtccga 

gagaggtgcc 

atttggcagc 

tgggtcacga 

tgccggcggc 

cttcagtggt 

caccatcaac 

agtccggacg 

cgacaaggtg 

ccagcagcag 

cagtgtcctg 

gctgaagacc 

caaacgcaca 

cctgaacaag 

gaaggtcctc 

ggtcctttcc 

tgcccagtac 

caaggtccag 



gcgctcaaac 
tgtcttttct 
ggcctggagc 
aggttctccc 
gtaggtgtgt 
aagcccagag 
aggacatata 
tctctgatag 
ggcgggcccc 
ttcagctcag 
agaagcctct 
caaggtgcct 
ttcggagctg 
aagggtggcc 
cagagcttgc 
gaagagcgcg 
cagttcttag 
acgaccacca 
aggaagcagc 
atgcaggaca 
gcagccgaga 
gtggagttgg 
tatgatgcgg 
atggacaaca 
gaggagattg 
cagctccaga 



tcaggctgtc 
ggaggaggga 
gctgggccca 
agagagtagt 
ttagtgacat 
atgtttcaga 
agagacgaag 
ctcccagctc 

ggggcttcag 

tctccatgtc 
acaacctcag 
gctttggggg 
gtttcggcac 
ctggcttccc 
tcacccccct 
aacagatcaa 
agcaacagaa 
cctccagcaa 
tagatacctt 
gcgtggagga 
atgactttgt 
aggccaaggt 
agctgtccca 
accgcaacct 
cccagaggag 
tctcggttga 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



BHC 03 1 051 



208- 



ccaacatggt 
ccagaggctg 
cgtggctgat 
agagctggag 
ccaggagctc 
gctggagggc 
ggtcagcggt 
tggcctgagt 
ctctggcagt 
gagacgaggt 
tccttcactt 
atcccatggg 
ttggagtttg 
gtcagctgac 
gaattattgg 
attatccatc 
aagcttagga 
ctgcaggtga 
<210> 388 



gacaacctga 
cgggcagaga 
gcagagcagc 
gctgccctgc 
atgagtgtga 
gaggagtaca 
agcaccagca 
agtggctttg 
tccagcagca 
ccctgcagct 
cacctccatc 
ctctctctgc 
gtagctactt 
ctctcacctc 
ggtcacatat 
tacatcagaa 
tgagaaccac 
atcttcaata 



agaacaccaa 
tcgagaacat 
gaggtgagaa 
agcaggccaa 
agctggcctt 
gaatgtctgg 
ctggaggcat 
gctccggctc 
agatcatctc 
cactgtgtcc 
ctctgtctct 
cccaggatga 
cttgatttgg 
ccatgggcag 
gtcccttccc 
ccaaactact 
ttagtgtccc 
aaatgctttt 



gagtgaaatt 
caagaagcag 
tgcccttaaa 
ggaggagctg 
ggacatcgag 
agaatgccag 
cagcggagga 
tggaagtggc 
taccaccacc 
agctgggccc 
ggggctcatc 
tcttctgtgc 
gcctggtgac 
agaagaaaat 
agtccaatgc 
tctccaacac 
attctactcc 
gtcattc 



gcagagctca 
tgccagactc 
gatgcccaca 
gcacgaatgc 
atcgccacct 
agtgccgtga 
ttaggaagtg 
tttgggtttg 
ctgaacaaga 
agcactggtg 
ttactagtat 
tgggacaggg 
ccacctggaa 
gaccaggagt 
catctcccac 
ccggcagcac 
tctcattccc 



acaggatgat 
ttcaggtatc 
gcaagcgcgt 
tgcgtgagta 
accgcaaact 
gcatctctgt 
gctccgggtt 
gtggcagtgt 
gacgatagag 
tctctgtgct 
cccctccact 
actctgcctc 
tgggaaggat 
gtcatctcca 
tagatcctgt 
ttggccctgc 
tcttatccat 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2617 



<211> 1752 

<212> DNA 

<213> Homo sapiens 



<400> 388 
ctgctccttc 
tgtccatcag 
gcagccgctc 
tgggcagcag 
gaggcatcac 
accccaacat 
agtttgcctc 
ccaagtggag 
agagctacat 
tggaggcgga 
atgagatcaa 
atgaagctta 
tcaacttcct 
acacatctgt 
ctgaggtcaa 
tgtaccagat 
ggcgcacaaa 
ttgagggcct 

agcgggccaa 

agctggccct 
ggctggagtc 
gtggtctgag 
ccagctttgg 
tggttgtgaa 
tgcccaagtg 
cctgggaagg 
gccctagccc 
ccagctacaa 
tctgccaaac 
<210> 389 



taggatctcc 

ggtgacccag 

ctacacgagt 

caactttcgc 

cgcagttacg 

ccaggccgtg 

cttcatagac 

cctcctgcag 

caacaacctt 

gcttggcaac 

taagcgtaca 

catgaacaag 

caggcagcta 

ggtgctgtcc 

ggcacagtac 

caagtatgag 

gactgagatc 

caaaggccag 

ggccattaag 

gcaggacatg 

ggacatcgag 

tgggatgcag 

ctcggcctat 

ctctggcgcg 

gaagatcgag 

aacagctgcg 

aggccgctat 

tcagcccacc 

aacaattcaa 

cc 



gcctggttcg 
aagtcctaca 
gggcccggtt 
ggtggcctgg 
gtcaaccaga 
cgcacccagg 
aaggtacggt 
cagcagaaga 
aggcggcagc 
atgcaggggc 
gagatggaga 
gtagagctgg 
tatgaagagg 
atggacaaca 
gaggatattg 
gagctgcaga 
tctgagatga 
agggcttccc 
gatgcnaacg 
gcgcggcagc 
atcgccacct 
aacatgagta 

gggggcctca 

ggctccagct 
acacgtgatg 
gcagcccctc 
gcagggtagc 
tggggagttt 
ttgctttttt 



gcccgcctgc 

aggtgtccac 

cccgcatcag 

gcggcggcta 

gcctgctgag 

agaaggagca 

tcctggagca 

cggctcgaag 

tggagactct 

tggtggagga 

acgaatttgt 

agtctcgcct 

agatccggga 

gccgctccct 

ccaaccgcag 

gcctggctgg 

accggaacat 

tggaggccgc 

ccaagttgtc 

tgcgtgagta 

acaggaagct 

ttcatacgaa 

caagccccgg 

ccttcagccg 

ggaagctggt 

ccagcctacc 

actgggaaca 

actacctggg 

tttttggtcc 



ctccactcct 

ctctggcccc 

ctcctcgagc 

tggtggggcc 

cccccttgtc 

gatcaagacc 

gcagaacaag 

caacatggac 

gggccaggag 

cttcaagaac 

cctcatcaag 

ggaagggctg 

gctgcagtcc 

ggacatggac 

ccgggctgag 

gaagcacggg 

cagccggctc 

cattgcagat 

cgagctggag 

ccaggagctg 

gctggagggc 

gaccaccagc 

cctcagctac 

caccagctcc 

gtctgagtcc 

cctcctgcgc 

ggagacccac 

gacccccctt 

aaaataaaac 



gcctccacca 

cgggccttca 

ttctcccgag 

agcggcatgg 

ctggaggtgg 

ctcaacaaca 

atgctggaga 

aacatgttcg 

aagctgaagc 

aagtatgagg 

aaggatgtgg 

accgacgaga 

cagatctcgg 

agcatcattg 

gctgagagca 

gatgacctgc 

caggctgaga 

gccgagcagc 

gccgccctgc 

atgaacgtca 

gaggagagcc 

ggctatgcag 

agcctgggct 

tccagggccg 

tctgacgtcc 

tgccccagag 

ctgaggctca 

gcccatgcct 

ctcagctagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

1020^ 

1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1752 



<211> 1412 



<212> DNA 



<213> Homo sapiens 



BHC03 1 051 



-209- 



<400> 389 

c 9999tcgtc 

accactcgct 

tacggcgccc 

tcccggatct 

gccaccggga 

atgcaaagcc 

gagaaccgga 

agagactgga 

actgtggaca 

tttagagtca 

gggctccgca 

gaggctctca 

ctacaagccc 

gacctcgcca 

cgagaggagc 

acacagtctg 

cagtccttgg 

ctgagggagg 

caccttgagt 

gaggccctgc 

ctggaagatg 

accatccaaa 

gacaccaaag 

gccaataaaa 

<210> 390 



cgcaaagcct 
ccaccttctc 
ggccggtcag 
ccgtgtcccg 
tagccggggg 
tgaacgaccg 
ggctggagag 
gccattactt 
atgcccgcat 
agtatgagac 
aggtcattga 
aggaggagct 
agattgccag 
agatcatggc 
tagacaagta 
ctgaggttgg 
agatcgacct 
tggaggcccg 
cagagctggc 
tgaacatcaa 
gcgaggactt 
agaccaccac 
ttctgaggca 
agttcagagt 



gagtcctgtc 

caccaactac 

cagcgcggcc 

ctccaccagc 

tctggcagga 

cctggcctct 

caaaatccgg 

caagatcatc 

cgttctgcag 

agagctggcc 

tgacaccaat 

gctcttcatg 

ctctgggttg 

agacatccgg 

ctggtctcag 

agctgctgag 

ggactccatg 

ctacgcccta 

acagacccgg 

ggtcaagctg 

taatcttggt 

ccgccggata 

ttaagccagc 

tcattggatg 



ctttctctct 

cggtccctgg 

agcgtctatg 

ttcaggggcg 

atgggaggca 

tacctggaca 

gagcacttgg 

gaggacctga 

attgacaatg 

atgcgccagt 

atcacacgac 

aagaagaacc 

accgtggagg 

gcccaatatg 

cagattgagg 

acgacgctca 

agaaatctga 

cagatggagc 

gcagagggac 

gaggctgaga 

gatgccttgg 

gtggatggca 

agaagcaggg 

tc 



ccccggacag 

gctctgtcca 

caggcgctgg 

gcatggggtc 

tccagaacga 

gagtgaggag 

agaagaaggg 

gggctcagat 

cccgtcttgc 

ctgtggagaa 

tgcagctgga 

acgaagagga 

tagatgcccc 

acgagctggc 

agagcaccac 

cagagctgag 

aggccagctt 

agctcaacgg 

agcgccaggc 

tcgccaccta 

acagcagcaa 

aagtggtgtc 

taccctttgg 



catgagcttc 
ggcgcccagc 
gggctctggt 
cgggggcctg 
gaaggagacc 
cctggagacc 
accccaggtc 
cttcgcaaat 
tgctgatgac 
cgacatccat 
gacagagatc 
agtaaaaggc 
caaatctcag 
tcggaagaac 

agtggtcacc 

acgtacagtc 
ggagaacagc 
gatcctgctg 
ccaggagtat 
ccgccgcctg 
ctccatgcaa 
tgagaccaat 
ggagcaggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1412 



<211> 2516 

<212> DNA 

<213> Homo sapiens 



<400> 390 

c ggggccggc 

ggcgaagggc 

gacagtccct 

ggctgactgc 

cggcgagatg 

ccacctggac 

attcctagag 

attcagcgat 

ggaccgcgcc 

ttatgcggcc 

gcgtaagaaa 

cctggtgaac 

cttccagttg 

cgctgacatc 

tgatggccca 

cgaccaagat 

ggcgcacgag 

ggcgccctac 

cgtgcagagc 

gctgttcact 

gcagggccct 

actgtacatt 

gccccgtcca 

gtcattgaat 

accaaccatc 

gggccccaag 

tcctgacgcc 



ggacccgcgg 
agcggaggag 
ctccgtgcgg 
cggctggttc 
gtcgcgcgcg 
gcccagcccg 
aagtacggat 
gccatcagag 
accctgcgcc 
tgggctgaga 
cgctttgcaa 
tggcctgagc 
tggagcaacg 
cggctcacct 

gggggcgccc 

gagcgctggt 
atcggtcaca 
tacaagaggc 
ctgtatggga 
gactttgaga 
aaatactgcc 
tttaaaggga 
ctgcaggaaa 
gatggagatt 
gtgggctcca 
ccagtgtggg 
gccctcttct 



gcaggcactg 

cggcccagag 

gggcggcgca 

cctgcgcgca 

tcggcctcct 

cggagcgcgg 

acctcaatga 

cgtttcagtg 

agatgactcg 

ggatcagtga 

agcaaggtaa 

atctgccgga 

tctcagcgct 

tcttccaagg 

tggcgcacgc 

ccctgagccg 

cgcttggcct 

tgggccgcga 

agcccctagg 

cctgggactc 

actcttcctt 

gccatttctg 

gatgggtcgg 

tctacttctt 

gcttaaagga 

gtctcccaca 

tccctcctct 



cccgggctgg 

cgcgcagcta 

gagcagtccc 

gtagctcccc 

gctgcgcgcc 

aggccaggag 

acaggtcccc 

ggtgtcccag 

tccccgctgc 

cttgtttgct 

caaatggtac 

gccggcagtt 

ggagttctgg 

ggaccacaac 

cttcctgccc 

ccgccgcggg 

cacccactcg 

cgcgctgctc 

gggctcagtg 

ctacagcccc 

cgatgccatc 

ggaggtggca 

gctgcccccc 

caaagtgcaa 

gagcagaggg 

gctgtgccgg 

gcgccgcctc 



acgacgtctg 
gggcactggc 
atccccgggg 
gagccgggct 
ctgcagctgc 
ctgcgcaagg 
aaagctccca 
ctacctgtca 

ggggttacag 

agacaccgga 
aagcagcacc 

c ggggcgccg 

gaggccccag 
gatgggctgg 
cgccgcggcg 
cgcaacctgt 
cccgcgccgc 
agctgggacg 
gccgtccagc 
caaggaaggc 
actgtagaca 
gctgatggca 
aacattgagg 
tccgtttgat 
ggtcgatgct 
gcagggggcc 
atcctcttca 



gccggctccc 60 

gaaaccccgg 120 

tcccgggcgc 180 

gcaccggagg 240 

tactgtgggg 3 00 

aggcggaggc 3 60 

cctccactcg 42 0 

gcggcgtgtt 480 

ataccaacag 540 

ccaaaatgag 600 

tctcctaccg 660 

tgcgcgccgc 72 0 

ccacaggccc 780 

gcaatgcctt 840 

aagcgcactt 900 

tcgtggtgct 960 

gcgcgctcat 1020 

acgtgctggc 10 8 0 

tcccaggaaa 114 0 

gccctgaaac 1200 

ggcaacagca 1260 

acgtctcaga 132 0 

ctgcggcagt 1380 

tgatatgaat 1440 

ggaggttccg 1500 

tgccccgcca 1560 

agggtgcccg 1620 
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ctactacgtg 
ggactgggga 
catcttcttc 
ctcgggccgc 
cgccctgttc 
atgttcccca 
ccgggcagca 
ttgtctttgt 
aaaagggtcc 
gtttcttcgg 
gggatgaaaa 
aggaccccct 
aggaagcatg 
tcaggacaca 
gggctggaat 
<210> 391 



ctggcccgag 
ggcatccctg 
cgagatgacc 
tgggccaccg 
tgaaggcacc 
cccccggggc 
aagcctccat 
ctgtccccac 
cagatccctt 
ctaaaggtac 
actcagcaga 
ggctcagttc 
gatggggatg 
gagtgggagg 
aaagaggtgc 



ggggactgca 
aggaggtcag 
gctactggcg 
agctgccctg 
tcctcacctc 
agaacccctc 
ctggaagtct 
cacatggagg 
ggccctttcc 
agttcctttc 
gaaattcgag 
ttgaaaaacg 
aaggcacagg 
gagactgatg 
cttcagctgg 



agtggagccc 
cggcgccctg 
cctcgaccag 
gatgggctgc 
agaaactggt 
ttagaagcct 
gtctgccttt 
tgggggtggg 
tccgaggact 
aagaggtaac 
accattttgc 
gtgtcatatt 
cgtctccaac 
caggcctacc 
tgggccgaga 



tactaccccc 
ccgaggcccg 
gccaaactgc 
tggcatgcca 
ggtgctctca 
ctgagtccct 
gttccttgaa 
atcaatctta 
tctatcctcc 
agcactggga 
aagactgtgc 
tagtcagagg 
ctcagaggcc 
agtccctggc 
ggcaggaagc 



gaagtctgca 
atggctccat 
aggcaaccac 
actcggggag 
gggcaaaatc 
ctgcagaaga 
gaatgcagca 
ggaaaagcaa 
ccaggccttt 
tccaagcagg 
ccttctcctc 
ccccaccccc 
ctttgtgggg 
tttttgtctg 
agcctt 



<211> 3291 

<212> DNA 

<213> Homo sapiens 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2516 



<400> 391 

aagtcgcgac 

gcgcagtggg 

tactcgcaca 

cagaactggc 

cacttcttgg 

ttgctgcagc 

acccagttgg 

gctcagaggg 

caacatgaga 

tccatcagcc 

aaagggaaga 

actctcaatg 

ggccgattaa 

cagcaaaaag 

ttcacagctg 

ctgagttgcc 

gcccaggtca 

ccctgcatgc 

gtccgaacaa 

tccattgaca 

aaccagaaaa 

taccfcgactc 

ctaggtgtga 

ctgaagcagg 

ctctcaattg 

cagcagttct 

tggcagttct 

aacaagctgt 

actaagcgag 

gagttggtac 

agtcggagcc 

ttcagtgaat 

aaggtgctca 

actgaaatca 

cgcttcctct 

gttaatctcc 

caggtggatg 

gagctggaac 

aaggcagggc 

gtgatagagc 



cagagccatt 

aaatgctgca 

gcctcctgcc 

aggaagctgc 

atcagctgaa 

acaatttgcg 

c t gaga t gat 

cccaattgga 

ttgaatcccg 

aactgaaaga 

caccctctct 

aactggacaa 

ctaccctaat 

cctgcatcag 

gagcaaagct 

tggttagcta 

cagagttgct 

cccaaactcc 

ggctgctggt 

ggaatcctcc 

ctttgacccc 

tggtggagca 

cagaggaact 

agctgaaaac 

cctgggcttc 

tctccaaccc 

cctcctatgt 

tcgggcagaa 

agagccctcc 

atgaccacct 

aggagcgccg 

cgtcagaagg 

tctactctgt 

tccgccatta 

atccccgaat 

aggaacggag 

aactgcaaca 

tagggctggt 

tggatctggg 

ccacactatg 



ggagggcgcg 

gaatcttgac 

tgtggacatt 

acttgggagt 

ctatgagtgt 

gaaattctgc 

ctttaacctc 

acaaggagag 

gatcctggat 

ccagcaggat 

ggacccccat 

aaggagaaag 

cgagctactg 

agctcccatt 

gttgtttcac 

tcaggatgac 

acagcgtctg 

ccatcgaccc 

gagactccag 

tcaattacaa 

cgagaagggg 

acgttcaggt 

gcacatcatc 

ggacaccctc 

agttctctgg 

ccccaaggcc 

tggccgaggc 

ctgtaggact 

tggcaagtta 

gaaggatctc 

gctgctgaag 

gggcattacc 

gcaaccgtac 

ccagttgctc 

cccccgggat 

gaaatacctg 

accgctggag 

gccagagcca 

gccagagcta 

catggtatca 



gggactgcaa 

agcccctttc 

cgacagtact 

gatgattcca 

ggccgttgca 

cgggacattc 

cttctggaag 

ccagttctcg 

ttaagggcta 

gtcttctgct 

cagaccaaag 

gaggtgctgg 

ctgccaaagt 

gaccacgggt 

ctgaggcagc 

cctctgacca 

ctccacagag 

ctcatcctca 

gaaggcaatg 

ggcttccgga 

cagagtcagg 

ggttcaggaa 

agcttcacgg 

cctgtggtga 

ttcaatttgc 

ccctggagct 

ctcaactcag 

gaggatccat 

ccattctgga 

tggaatgatg 

aagaccatgt 

tgctcctggg 

acgaaggagg 

actgaggaga 

gaagcttttg 

aaacacaggc 

cttaagccag 

gagctcagcc 

gagtctgtgc 

caaacagtgc 



ccctaatcag 

aggatcagct 

tggctgtctg 

aggctaccat 

gccaggaccc 

agcccttttc 

aaaaaagaat 

aaacacctgt 

tgatggagaa 

tccgatataa 

agcagaagat 

atgcctccaa 

tggaggagtg 

tggaacagct 

tgctgaagga 

aaggggtgga 

cctttgtggt 

agactggcag 

agtcactgac 

agttcaacat 

gtttgatttg 

agggcagcaa 

tcaaatatac 

ttatttccaa 

tcagcccaaa 

tgctgggccc 

accagctgag 

tattgtcctg 

catggctgga 

gacgcatcat 

ctggcacctt 

tggagcacca 

tgctgcagtc 

atatacctga 

ggtgctacta 

tcattgtggt 

agccagagct 

tggacttaga 

tggagtccac 

cagagccaga 



dU 



agcccaaatg 60 

gcaccagctt 120 

gattgaagac 180 

gctattcttc 240 

agagtccttg 300 
ccaggatcct 360 

tttgatccag 420 
ggagagccag 480 
gctggtaaaa 540 
gatccaggcc 600 
t c t gc agg aa 660 
agcactgcta 720 
gaaggcccag 780 
ggagacatgg 840 
gctgaaggga 900 
cctacgcaac 960 

agaaacccag 1020 

caagttcacc 1080. 

tgtggaagtc 11 

tctgacttca 12 

ggactttggt 1260 

taaggggcca 1320 

ctaccagggt 1380 

catgaaccag 1440 

ccttcagaac 1500 

tgctctcagt 1560 

catgctgaga 1620 

ggctgacttc 1680 

caaaattctg 1740 

gggctttgtg 1B00 

tctactgcgc 1860 

ggatgatgac 1920 

actcccgctg 1980 

aaacccactg 2040 

ccaggagaaa 2100 

ctctaataga 2160 

ggagtcatta 2220 

gccactgctg 2280 

tc tggagcc t 2340 

ccaaggacct 2400 
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gtatcacagc cagtgccaga gccagatttg ccctgtgatc tgagacattt gaacactgag 2460 

ccaatggaaa tcttcagaaa ctgtgtaaag attgaagaaa tcatgccgaa tggtgaccca 252 0 

ctgttggctg gccagaacac cgtggatgag gtttacgtct cccgccccag ccacttctac 2580 

actgatggac ccttgatgcc ttctgacttc taggaaccac atttcctctg ttcttttcat 264 0 

atctctttgc ccttcctact cctcatagca tgatattgtt ctccaaggat gggaatcagg 2700 

catgtgtccc ttccaagctg tgttaactgt tcaaactcag gcctgtgtga ctccattggg 2760 

gtgagaggtg aaagcataac atgggtacag aggggacaac aatgaatcag aacagatgct 2820 

gagccatagg tctaaatagg atcctggagg ctgcctgctg tgctgggagg tataggggtc 2880 

ctgggggcag gccagggcag ttgacaggta cttggagggc tcagggcagt ggcttctttc 2940 

cagtatggaa ggatttcaac attttaatag ttggttaggc taaactggtg catactggca 3 000 

ttggccttgg tggggagcac agacacagga taggactcca tttctttctt ccattccttc 3060 

atgtctagga taacttgctt tcttctttcc. tttactcctg gctcaagccc tgaatttctt 3120 

cttttcctgc aggggttgag agctttctgc cttagcctac catgtgaaac tctaccctga 3180 

agaaagggat ggataggaag tagacctctt tttcttacca gtctcctccc ctactctgcc 3240 

ccctaagctg gctgtacctg ttcctccccc ataaaatgat cctgccaatc t 3291 

<211> 1283 

<212> DNA 

<213> Homo sapiens 



<400> 392 
ctctctgctc 
catcgctcag 
cgcctggtca 
cccttcattg 
ttccatggca 
atcttccagg 
gtggagtcca 
gccaaaaggg 
aaccatgaga 
tgcttagcac 
accacagtcc 
tggcgtgatg 
gctgtgggca 
cccactgcca 
gatgacatca 
tacactgagc 
gacgctgggg 
aacgaatttg 
taagacccct 
ctggggagtc 
ccatgtagac 
aataaagtac 
<210> 393 



ctcctgttcg 
acaccatggg 
ccagggctgc 
acctcaacta 
ccgtcaaggc 
agcgagatcc 
ctggcgtctt 
tcatcatctc 
agtatgacaa 
ccctggccaa 
atgccatcac 
gccgcggggc 
aggtcatccc 
acgtgtcagt 
agaaggtggt 
accaggtggt 
ctggcattgc 
gctacagcaa 
ggaccaccag 
cctgccacac 
cccttgaaga 
cctgtgctca 



acagtcagcc 
gaaggtgaag 
ttttaactct 
catggtttac 
tgagaacggg 
ctccaaaatc 
caccaccatg 
tgccccctct 
cagcctcaag 
ggtcatccat 
tgccacccag 
tctccagaac 
tgagctgaac 
ggtggacctg 
gaagcaggcg 
ctcctctgac 
cctcaacgac 
cagggtggtg 
ccccagcaag 
tcagtccccc 
ggggaggggc 
acc 



gcatcttctt 
gtcggagtca 
ggtaaagtgg 
atgttccaat 
aagcttgtca 
aagtggggcg 
gagaaggctg 
gctgatgccc 
atcatcagca 
gacaactttg 
aagactgtgg 
atcatccctg 
gggaagctca 
acctgccgtc 
tcggagggcc 
ttcaacagcg 
cactttgtca 
gacctcatgg 
agcacaagag 
accacactga 
ctagggagcc 



ttgcgtcgcc 
acggatttgg 
atattgttgc 
atgattccac 
tcaatggaaa 
atgctggcgc 
gggctcattt 
ccatgttcgt 
atgcctcctg 
gtatcgtgga 
atggcccctc 
cctctactgg 
ctggcatggc 
tagaaaaacc 
ccctcaaggg 
acacccactc 
agctcatttc 
cccacatggc 
gaagagagag 
atctcccctc 
gcaccttgtc 



agccgagcca 
tcgtattggg 
catcaatgac 
ccatggcaaa 
tcccatcacc 
tgagtacgtc 
gcagggggga 
catgggtgtg 
caccaccaac 
aggactcatg 
cgggaaactg 
cgctgccaag 
cttccgtgtc 
tgccaaatat 
catcctgggc 
ctccaccttt 
ctggtatgac 
ctccaaggag 
accctcactg 
ctcacagttg 
atgtaccatc 



60 
120 
180 
240 
300 
I- 360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1283 



<211> 331 
<212> PRT 



<213> Homo sapiens 



<400> 393 

Met Gly Gly Ser Ala Gly Arg Glu 
1 5 
Leu Thr Arg Thr Gin Ser Ala Phe 
20 

Phe Thr Gly Glu Thr Val Ser Leu 
35 40 



Leu Asp Ala Gly Arg Lys Pro Lys 

10 " 15 

Ser Pro Val Ser Phe Ser Pro Leu 
25 30 
Val Asp Val Asp He Ser Gin Arg 
45 
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Gly Leu 


Thr 


Ser 


Pro 


His 


Pro 


Pro Thr 


Pro Pro 


Pro 


Pro 


Pro Arg Arg 


50 










55 






60 






Ser Leu 


Ser 


Leu 


Leu 


Asp 


Asp 


He Ser Gly Thr 


Leu 


Pro 


Thr Ser Val 


65 








70 






75 






80 


Leu Val 


Ala 


Pro 


Met 


Gly 


Ser 


Ser Leu 


Gin Ser 


Phe 


Pro 


Leu Pro Pro 








85 








90 






95 


Pro Pro 


Pro 


Pro 


His 


Ala 


Pro Asp Ala 


Phe Pro 


Arg 


He 


Ala Pro He 






100 








105 








110 


Arg Ala Ala Glu 


Ser 


Leu 


His 


Ser Gin 


Pro Pro 


Gin 


His 


Leu Gin Cys 




115 










120 






125 




Pro Leu 


Tyr Arg 


Pro Asp 


Ser 


Ser Ser 


Phe Ala 


Ala 


Ser Leu Arg Glu 


130 










135 






140 






Leu Glu 


Lys 


Cys 


Gly Trp Tyr Trp Gly Pro Met Asn Trp Glu Asp Ala 


145 








150 






155 






160 


Glu Met 


Lys 


Leu 


Lys 


Gly Lys 


Pro Asp Gly Ser 


Phe 


Leu 


Val Arg Asp 








165 








170 






175 


Ser Ser Asp 


Pro 


Arg 


Tyr 


He 


Leu Ser 


Leu Ser 


Phe 


Arg 


Ser Gin Gly 






180 








185 








190 


lie Thr 


His 


His 


Thr 


Arg 


Met 


Glu His 


Tyr Arg 


Gly 


Thr 


Phe Ser Leu 




195 










200 






205 




Trp Cys 


His 


Pro 


Lys 


Phe 


Glu 


Asp Arg 


Cys Gin 


Ser 


Val 


Val Glu Phe 


210 










215 






220 






lie Lys 


Arg 


Ala 


He 


Met 


His 


Ser Lys Asn Gly 


Lys 


Phe 


Leu Tyr Phe 


225 








230 






235 






240 


Leu Arg 


Ser Arg 


Val 


Pro Gly Leu Pro 


Pro Thr 


Pro 


Val 


Gin Leu Leu 






245 








250 






255 


Tyr Pro 


Val 


Ser 


Arg 


Phe 


Ser 


Asn Val 


Lys Ser 


Leu 


Gin 


His Leu Cys 




260 








265 








270 


Arg Phe 


Arg 


He 


Arg 


Gin 


Leu 


Val Arg 


He Asp His 


He 


Pro Asp Leu 




275 










280 






285 




Pro Leu 


Pro 


Lys 


Pro 


Leu 


He 


Ser Tyr 


He Arg 


Lys 


Phe 


Tyr Tyr Tyr 


290 










295 






300 






Asp Pro 


Gin 


Glu 


Glu 


Val 


Tyr 


Leu Ser 


Leu Lys 


Glu 


Ala 


Gin Leu He 


305 








310 






315 






320 


Ser Lys 


Gin 


Lys 


Gin 


Glu 


Val 


Glu Pro 


Ser Thr 












325 








330 








<210> 


3 94 




















<211> 


306 




















<212> 


PRT 




















<213> 


Homo 


sapiens 

















<400> 394 




Met 


Ala 


Ala 


Pro 


1 








Gly 


Trp 


Trp 


Ser 








20 


Val 


Pro 


Val 


Arg 






35 




Lys 


Phe 


Lys 


Thr 




50 






Leu 


Val 


He 


Pro 


65 








Thr 


Ala 


Gly He 


He 


Ser 


Ser 


Leu 








100 


Lys 


Lys 


Thr 


Met 






115 




Asp 


Ala 


Asn 


Phe 




130 







He Pro Gin Glv 
5 

Arg Gin Pro Val 

Thr Lys Lys Arg 
40 

Glu Lys Glu Phe 
55 

Pro Glu Lys Ser 
70 

Phe Asp Ala Tyr 
85 

Ser Lys Glu Gly 

Ala Ser Gin Val 
120 

Lys He Lys Asp 
135 



Phe Ser Cys Leu 
10 

Leu Val Thr Gin 
25 

Phe Thr Pro Pro 

Met Gin His Ala 
60 

Asp Arg Ser He 
75 

Val Pro Pro Glu 
90 

Leu He Glu Arg 
105 

Ser He Arg Arg 

Phe Pro Glu Lys 
140 



Ser Arg Val Leu 
15 

Ser Ala Ala He 
30 

He Tyr Gin Pro 
45 

Arg Lys Ala Gly 

His Leu Ala Cys 
80 

Gly Asp Ala Arg 
95 

Thr Glu Arg Met 
110 

He Lys Asp Tyr 
125 

Ala Lys Asp He 
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Phe 


lie 


Glu 


Ala 


His 


Leu 


Cys 


Leu 


145 










150 




His 


Thr 


Leu 


Val 


Thr 


Glu 


His 


Cys 










165 








Lys 


Tyr 


Lys 


Thr 


Val 


Arg 


Trp 


Ser 








180 










His 


Val 


Val 


Gin 


Val 


Arg 


Cys 


Ser 






195 










200 


Tyr 


Gly 


Gin 


He 


Thr 


Val 


Arg 


Met 




210 










215 




Tyr 


Asp 


Arg 


Phe 


Gly Arg Leu 


Met 


225 










230 






Asp 


Val 


Leu 


Glu 


Tyr 


Val 


Val 


Phe 










245 








Gly 


Ser 


Trp 


Arg Met 


His 


Thr 


Lys 








260 










Lys 


Gin 


Pro 


He 


Leu 


Lys 


Thr 


Val 






275 












Pro 


Glu 


Glu 


Glu Tyr Glu Glu 


Ala 




290 










295 




Leu 


Ala 














305 
















<210> 395 












<211> 557 
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70 










75 










80 


Asn 


Leu 


Thr 


Thr 


Phe 


Thr 


Gin 


Pro 


Cys 


Cys 


Glu 


Pro 


Cys 


Leu 


Pro 


Arg 



85 90 95 



Gly Cys 

<210> 400 

<211> 98 

<212> PRT 

<213> Homo sapiens 



<400> 400 












Met 


Asp 


Cys 


Cys 


Ala 


Ser Arg 


Ser 


1 








5 








Thr 


Thr 


He 


Cys 


Ser 


Ser 


Asp 


Lys 








20 










Pro 


Ser 


Thr 


Cys 


Pro 


His 


Thr 


Val 






35 










40 


Asp 


Asn 


Cys 


Pro 


Pro 


Pro 


Cys 


His 




50 










55 




Cys 


Phe 


Leu 


Leu 


Asn 


Ser 


Cys 


Gin 



65 70 



Cys Ser Val Pro Thr Gly Pro Ala 

10 15 
Ser Cys Arg Cys Gly Val Cys Leu 
25 ^ 30 

Trp Leu Leu Glu Pro He Cys Cys 
45 

He Pro Gin Pro Cys Val Pro Thr 
60 

Pro Thr Pro Gly Leu Glu Thr Leu 
75 80 
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Asn Leu Thr Thr Phe Thr Gin Pro Cys Cys Glu Pro Cys Leu Pro Arg 
85 90 95 

Gly Cys 

<210> 401 

<211> 79 

<212> PRT 

<213> Homo sapiens 



<400> 401 

Met Ser Cys Cys Asp Ser Tyr Leu Gin Gly Cys Cys Ser Val Pro Thr 

1 5 10 15 

Gly Leu Ala Thr Thr lie Cys Pro Ser Asp lie Ser Cys Gin Cys Glu 

20 25 30 

Val Cys Leu Pro Ser Thr Cys Pro His Glu lie Ser Leu Leu Gin Pro 

35 40 45 

Thr Cys Cys Glu Pro Gly Pro Cys Leu Ala Ala Cys Leu Thr Pro Met 

50 55 60 

Cys His Pro Val Asp Cys Ser Thr Asn Ala Thr Gin Leu Gin Pro 
65 70 75 

<210> 402 

<211> 98 

<212> PRT 

<213> Homo sapiens 



<400> 402 

Met Tyr Cys Cys Ala Leu Arg Ser Cys Ser Val Pro Thr Gly Pro Ala 

15 10 15 

Thr Thr Phe Cys Ser Phe Asp Lys Ser Cys Arg Cys Gly Val Cys Leu 

20 25 30 

Pro Ser Thr Cys Pro His Glu lie Ser Leu Leu Gin Pro lie Cys Cys 

35 40 45 

Asp Thr Cys Pro Pro Pro Cys Cys Lys Pro Asp Thr Tyr Val Pro Thr 

50 55 60 

Cys Trp Leu Leu Asn Asn Cys His Pro Thr Pro Gly Leu Ser Gly lie 
65 70 75 80 

Asn Leu Thr Thr Tyr Val Gin Pro Gly Cys Glu Ser Pro Cys Glu Pro 
85 90 95 

Arg Cys 

<210> 403 

<211> 174 

<212> PRT 

<213> Homo sapiens 



<400> 403 

Met Thr Cys Cys Gin Thr Ser Phe 
1 5 
Ser Gly Thr Cys Gly Ser Ser Cys 
20 



Cys Gly Tyr Pro Ser Phe Ser lie 

10 15 

Cys Gin Pro Ser Cys Cys Glu Thr 
25 30 
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Ser Cys 


Cys 


Gin Pro 


Arg 


Ser 


Cys 




35 








40 


Ser Phe 


Ser 


Thr Ser 


Gly 


Thr 


Cys 


50 








55 


Cys Cys 


Glu 


Thr Ser 


Cys 


Cys 


Gin 


65 






70 






Cys Gin 


Pro 


Ser Cys 


Cys 


Gin 


He 






85 








lie Gly 


Gly 


Gly lie 


Ser 


Tyr 


Gly 






100 








Ser Thr 


Arg 


lie Arg 


Trp 


Cys 


Arg 




115 








120 


Tyr Leu 


Pro 


Pro Cys 


Cys 


Val 


Val 


130 








135 




Gin Leu 


His 


His Ala 


Gin 


Ala 


Ser 


145 






150 






Gin Ser 


Cys 


Cys Arg 


Pro 


Val 


Cys 






165 








<210> 404 










<211> 


167 










<212> 1 


PRT 










<213> Homo 


sapiens 








<400> 404 










Met Thr 


Cys 


Cys Gin 


Thr 


Ser 


Phe 


1 




5 








Ser Gly 


Thr 


Cys Gly 


Ser 


Ser 


Cys 






20 








Ser Cys 


Cys 


Gin Pro 


Ser 


Cys 


Cys 




35 








40 


Ser Phe 


Ser 


Thr Ser 


Gly 


Thr 


Cys 


50 








55 




Cys Cys 


Glu 


Thr Ser 


Cys 


Cys 


Gin 


65 






70 




Cys Gly 


Thr 


Gly Cys 


Gly 


He 


Gly 






85 








Gly Ser 


Ser 


Gly Ala 


Val 


Ser 


Thr 






100 








Cys Arg 


Val 


Glu Gly 


Thr 


Cys 


Leu 




115 








120 


Thr Pro 


Pro 


Thr Cys 


Cys 


Gin 


Leu 


130 








135 




Arg Pro 


Ser 


Tyr Cys 


Gly 


Gin 


Ser 


145 






150 






Tyr Ser 


Cys 


Glu Pro 


Thr 


Cys 








165 








<210> 405 










<211> 177 










<212> PRT 










<213> Homo 


sapiens 








<400> 405 










Met Ala 


Cys 


Cys Gin 


Thr 


Ser 


Phe 


1 




5 








Ser Gly Thr 


Cys Gly 


Ser 


Ser 


Cys 



20 
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Gin 


Thr Ser 


Phe 


Cys 


Gly 


Phe 


Pro 








45 








Ser 


Ser Ser 


Cys 


Cys 


Gin 


Pro 


Ser 






60 










Pro 


Ser Cys 


Cys 


Glu 


Thr 


Ser 


Cys 




75 










80 


Ser 


Ser Cys 


Gly 


Thr 


Gly Cys 


Gly 




90 








95 




Gin Glu Gly 


Ser 


Ser 


Gly Ala Val 


105 








110 






Pro Asp Ser 


Arg 


Val 


Glu Gly Thr 








125 








Ser 


Cys Thr 


Pro 


Pro 


Car 


cys 


cys 






140 








Cys 


Cys Arg 


Pro 


Ser 


iyr 


cys 


\jXy 




155 










XoU 


Cys Cys Glii 


Pro 


Thr 


Cys 








170 












Cys 


Gly Tyr 


Pro 


Ser 


Cys 


Ser 


Thr 




10 








15 




Cys 


Gin Pro 


Ser 


Cys 


Cys 


Glu 


Thr 


25 








30 






Gin 


Thr Ser 


Phe 


Cys 


Gly 


Phe 


Pro 








45 








Ser 


Ser Ser 


Cys 


Cys 


Gin 


Pro 


Ser 






60 










Pro 


Ser Cys 


Cys 


Gin 


Thr 


Ser 


Ser 




75 










80 


Gly Gly He 


Gly 


Tyr 


Gly Gin Glu 




90 








95 




Arg 


He Arg 


Trp 


Cys 


Arg 


Pro Asp 


105 








110 






Pro 


Pro Cys 


Cys 


Val 


Val 


Ser 


Cys 








125 








His 


His Ala 


Glu 


Ala 


Ser 


Cys 


Cys 






140 








Cys 


Cys Arg 


Pro 


Val 


Cys 


Cys 


Cys 




155 










160 


Cys 


Gly Phe 


Pro 


Ser 


Cys 


Ser 


Thr 




10 






15 




Cys 


Gin Pro 


Ser 


Cys 


Cys 


Glu 


Thr 
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Ser Ser Cys 


Gin 


Pro 


Arg 


Cys 


Cys 


Glu 


Thr 


Ser 


Cys 


Cys 


Gin Pro 


Ser 


35 










40 










45 






Cys Cys Gin 


Thr 


Ser 


Phe 


Cys 


Gly 


Phe 


Pro 


Ser 


Phe 


Ser 


Thr Gly Gly 


50 








55 










60 








Thr Cys Asp 


Ser 


Ser 


Cys 


Cys 


Gin 


Pro 


Ser 


Cys 


Cys 


Glu 


Thr Ser 


Cys 


65 






70 










75 








80 


Cys Gin Pro 


Ser 


Cys 


Tyr 


Gin 


Thr 


Ser 


Ser 


Cys 


Gly Thr 


Gly Cys 


Gly 






85 










90 








95 




lie Gly Gly 


Gly He 


Gly 


Tyr Gly Gin Glu Gly Ser Ser 


Gly Ala Val 




100 










105 










110 




Ser Thr Arg 


He 


Arg 


Trp 


Cys 


Arg 


Pro Asp 


Cys 


Arg 


Val 


Glu Gly Thr 


115 










120 










125 






Cys Leu Pro 


Pro 


Cys 


Cys 


Val 


Val 


Ser 


Cys 


Thr 


Pro 


Pro 


w w x. uy a 


uys 


130 








X J 3 


















Gin Leu His 


His 


Ala 


Glu 


Ala 


Ser 


CVr 


Cys Arg 


XT J- \s 


Car* 

0*S Mm 






145 






150 










lob 








ibU 


Gin Ser Cys 


Cys 


Arg 


Pro 


Val 


Cys 


*«■» Jr Es 


Cys 


Tyr 




Oar 


f2l n t)*m 


i. XXX 






165 










170 








JL /o 




Cys 


























<210> 406 


























<211> 85 


























<212> PRT 


























<213> Homo 


sapiens 






















<400> 406 


























Val Thr Cys 


Val 


Pro 


Arg 


Cys 


Thr 


Arg 


Pro 


lie 


Cys 


Glu 


Pro Cys 


Arg 


1 




5 










10 








15 




Arg Pro Val 


Cys 


Cys 


Asp 


Pro 


Cys 


Ser 


Leu 


Gin 


Glu 


Gly 


Cys Cys 


Arg 




20 










25 










3 0 




Pro He Thr 


Cys 


Cys 


Pro 


Ser 


Ser 


Cys 


Thr 


Ala 


Val 


Val 


Cys Arg 


Pro 


35 










40 










A IT 

45 






Cys Cys Trp 


Ala 


Thr 


Thr 


Cys 


Cys 


Gin 


Pro 


Val 


q Ar 

OCI 


val 


Gin Ser 


Pro 


50 








55 










o u 








Cys Cys Arg 


Pro 


Pro 


Cys 


Gly Gin Pro Thr 


Pro 


v. jr o 




Thr Thr 


Cys 


65 






70 










75 






80 


Arg Thr Ser 


Ser 


Cys 
85 






















<210> 407 


























<211> 128 


























<212> PRT 


























<213> Homo 


sapiens 






















<400> 407 


























Met Thr Gly 


Ser 


Cys 


Cys 


Gly Ser Thr Leu 


Ser 


Ser 


Leu 


Ser Tyr Gly 


1 




5 










10 








15 




Gly Gly Cys 


Cys 


Gin 


Pro 


Cys 


Cys 


Cys 


Arg Asp 


Pro 


Cys 


Cys Cys 


Arg 




20 










25 










30 




Pro Val Thr 


Cys 


Gin 


Thr 


Thr 


Val 


Cys 


Arg 


Pro 


Val 


Thr 


Cys Val 


Pro 


35 










40 










45 






Arg Cys Thr 


Arg 


Pro 


He 


Cys 


Glu 


Pro 


Cys 


Arg 


Arg 


Pro 


Val Cys 


Cys 


50 








55 










60 








Asp Pro Cys 


Ser 


Leu 


Gin 


Glu Gly Cys 


Cys 


Arg 


Pro 


He 


Thr Cys 


Cys 


65 






70 










75 








80 


Pro Ser Ser 


Cys 


Thr 


Ala 


Val 


Val 


Cys 


Arg 


Pro 


Cys 


Cys 


Trp Ala 


Thr 






85 










90 








95 
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Thr Cys Cys Gin Pro Val Ser Val Gin Ser Pro Cys Cys Arg Pro Pro 

100 105 ~ no 

Cys Gly Gin Pro Thr Pro Cys Ser Thr Thr Cys Arg Thr Ser Ser Cys 

115 120 " 125 

<210> 408 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 408 

Met Glu Thr His Cys Thr Gly Arg Ser Ala Ser Phe Cys Ser Ser Ser 
15 10 15 

Ala lie Leu He 
20 

<210> 409 
<211> 210 
<212> PRT 
<213> Homo sapiens 



<400> 409 




























Met Val 


Ser 


Ser 


Cys 


Cys 


Gly Ser Val 


Cys 


Ser Asp Gin Gly Cys Gly 


1 






5 










10 










15 




Gin val 


Leu 


Cys 


Gin 


Glu 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Gin 


Thr 






20 










25 










30 






Thr Cys 


Cys 


Arg 


Thr 


Thr 


Cys 


Tyr Arg 


Pro 


Ser 


Cys 


Cys 


Val 


Ser 


Ser 




35 










40 










45 








Cys Cys 


Arg 


Pro 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 


Gin 


Pro 


Thr 


Cys 


50 










55 










60 








Cys Arg 
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Ser 


Cys 


Cys 


Glu 


Thr 


Thr 


Cys 


Cys 


His 


Pro Arg 


Cys 


Cys 


65 








70 










75 










80 


He Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Met 


Ser 


Ser 


Cys 


Cys 


Lys 








85 










90 










95 




Pro Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 


Gin 


Pro 


Thr 


Cys 


Cys 


Arg 


Pro 






100 










105 










110 




Ser Cys 


Cys 


He 


Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Val 


Ser 


Arg 




115 










120 










125 






Cys Cys 


Arg 


Pro 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 


Gin 


Pro 


Thr 


Cys 


130 










135 










140 








Cys Arg 


Pro 


Ser 


Cys 


Cys 


He 


Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


145 








150 










155 








160 


Glu Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


Cys 


Cys 


Cys 


Arg 


Pro 


Cys 


Cys 


Cys 


Leu 








165 










170 










175 




Arg Pro 


Val 


Cys Gly 


Arg Val 


Ser 


Cys 


His 


Thr 


Thr 


Cys 


Tyr Arg 


Pro 






180 










185 










190 






Thr Cys 


Val 


He 


Ser 


Thr 


Cys 


Pro Arg 


Pro 


Leu 


Cys 


Cys 


Ala 


Ser 


Ser 




195 










200 








205 








Cys Cys 






























210 






























<210> 410 




























<211> 195 




























<212> PRT 




























<213> Homo 
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<400> 410 












Met 


Val 


Asn 


Ser Cys 


Cys 


Gly 


Ser 


1 








5 








Gin Asp 


Leu 


Cys 


Gin 


Glu 


Thr 


Cys 








20 










Thr 


Cys 


Cys Arg Thr 


Thr 


Tyr 


Cys 






35 










40 


Cys 


Cys 


Arcr 


Pro 


Gin 


Cys 


Cys 


Gin 




50 










55 




Cvs 


Arcr 


Pro 


Arcr 


Cys 


Cys 


He 


Ser 


65 










70 






Val 


Ser 


Ser 


Cys 


Cys 


Lys 


Pro 


Gin 










85 








Pro 


Thr 


Cys 


Cys 


Arg 


Pro Arg 


Cys 








100 










Ser 


Cys 


Cys 


Val 


Ser 


Ser 


Cys 


Cys 






115 










120 


Cys 


Cys 


Gin 


Pro 


Thr 


Cys 


Cys 


His 




130 










135 




Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Glu 


Ser 


145 










150 






Leu 


Arg 


Pro 


Val 


Cys 


Gly Gly Val 










165 








Pro 


Thr 


Cys 


Val 


He 


Ser 


Ser 


Cys 








180 








Ser 


Cys 


Cys 
















195 
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<211> 201 












<212> PRT 
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sapiens 








<400> 411 












Met 


Val 


Asn 


Ser 


Cys 


Cys 


Gly Ser 


1 








5 








Leu 


Glu 


Asn 


Cys 


Cys 


Arg 


Pro 


Ser 








20 










Thr 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 






35 










40 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 




50 










55 




Cys 


Cys 


Gin 


Thr 


Thr 


Cys 


Cys Arg 


65 










70 






Cys 


Val 


Ser 


Ser 


Cys 


Cys 


Arg 


Pro 










85 








Gin 


Pro 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 








100 










Thr 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 






115 










120 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 




130 










135 




Cys 


Cys 


He 


Ser 


Ser 


Ser 


Cys 


Cys 


145 










150 






Cys 


Arg 


Pro 


Cys 


Cys 


Cys 


Leu Arg 
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His 


Thr 


Thr 


Cys 


Tyr 


Arg 


Pro 


Thr 








180 
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Leu 


Cys 
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Ser 


Ser 


Cys 






195 










200 
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Val 


Cys 


Ser 


His 


Gin 


Gly 


Cys 


Gly 




10 










15 




Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Glu 


Thr 


25 










30 






Arg 


Pro 


Ser 


Cys 


Cys 


Val 


Ser 


Ser 










45 








Ser 


Val 


Cys 


Cys 


Gin 


Pro 


Thr 


Cys 








60 










Ser 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 


Cys 






75 










80 


Cys 


Cys 


Gin 


Ser 


Met 


Cys 


Cys 


Gin 




90 










95 




Cys 


He 


Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


105 










110 






Arg 


Pro 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 










125 








Pro 


Ser 


Cys 


Ser 


He 


Ser 


Ser 


Cys 








140 










Ser 


Cys 


Cys 


Arg 


Pro 


Cys 


Cys 


Cys 






155 










160 


Ser 


Cys 


His 


Thr 


Thr 


Cys 


Tyr 


Arg 




170 










175 




Pro 


Arg 


Pro 


Leu 


Cys 


Cys 


Ala 


Ser 


185 










190 






Val 


Cys 


Ser 


Asp 


Gin 


Gly 


Cvs 


Gly 




10 










15 




Cys 


Cys 


Gin 


Thr 


Thr 


Cvs 


Cvs 


Arcr 


25 










30 






Cys 


Val 


Ser 


Ser 


Cys 


Cys 


Arq 


Pro 










45 








Gin 


Pro 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 








60 










Thr 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 






75 










80 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 


Cys 




90 










95 




Cys 


Cys 


Gin 


Thr 


Thr 


Cys 


Cys 


Arg 


105 
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Cys 


Val 


Ser 


Ser 


Cys 


Cys 


Arg 


Pro 










125 








Gin 


Pro 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 








140 










Pro 


Ser 


Cys 


Cys 


Glu 


Ser 


Ser 


Cys 






155 










160 


Pro 


Val 


Cys 


Gly 


Arg 


Val 


Ser 


Cys 




170 










175 




Cys 


Val 


He 


Ser 


Thr 


Cys 


Pro 


Arg 


185 
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Cys 
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<212> PRT 
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<400> 412 



Met 
1 


Val 


Ser 


Ser 


Cys 
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Cvs 


Glv 


Ser 


Leu 


Glu 


Asn 


Cys 


Cvs 


Arg 


Pro 










20 










Thr 


Thr 


Cys 


Cys 


Arg 


Pro 


Ser 


Cys 






35 










40 


Val 


Cys 


Tyr 


Gin 


Pro 


Thr 


Cys 


Cys 




50 










55 




Cys 


Cys 


His 


Pro 


Tyr 


Cys 


Cys 


Glu 


65 










70 






Cys 


Arg 


Pro 


Ser 


Cys 


Cys 


Gin 


Thr 










85 








Arg 


Thr 


Thr 


Cys 


Cys 


Cys 


Pro 


Ser 








100 










Pro 


Gin 


Cys 


Cys 


Gin 


Ser 


Val 


Cys 






115 










120 


Ser 


Cys 


Cys 
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Ser 


Ser 


Cys 


Cys 




130 










135 




Cys 


Cys 


Arg 


Pro 


Cys 
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Cys 


Val 


145 










150 
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Thr 


Thr 


Cys 


Tyr 


Arg 


Pro 










165 








Arg 


Pro 


Leu 


Cys 
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Ala 


Ser 


Ser 
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<400> 413 



Met Val 
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Cys 
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Cys Gly Ser 


Leu Glu 


Asn 


Cys 


Cys 


Arg Pro 


Ser 






20 








Thr Thr 


Cys 


Cys 


Arg 


Pro Ser 


Cys 




35 








40 


Gin Cys 


Cys 


Gin 


Ser 


Val Cys 


Cys 


50 








55 


Tyr Cys 


Val 


Ser 


Ser 


Cys Cys 


Arg 


65 








70 




Cys Arg 


Thr 


Thr 


Cys 


Cys Arg 


Pro 








85 






Arg Pro 
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Cys 


Gly Gin Ser 


Leu 






100 
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Val 


Ser 


Ser 


Glu 


wXXJi 




Cys 
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\jJLy 
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X mJ 


Cys 


Cys 


Gin 




XllX 


cys 
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Arg 
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Cys 


Lys 


Pro 


Gin 


Cys 


Cys 


Gin 


Ser 










45 








His 


Pro 


Ser 


Cys 


Cys 


lie 


Ser 


Ser 








60 










Ser 


Ser 


Cys 


Cys 


Arg 


Pro 


Cys 


Cys 






75 










80 


Thr 


Cys 


Cys 


Arg 


Thr 


Thr 


Cys 


Cys 




90 










95 




Cys 


Cys 


Val 


Ser 


Ser 


Cys 


Cys 


Arg 


105 










110 




Cys 


Gin 


Pro 


Thr 


Cys 


Cys 


Arg 


Pro 










125 








His 


Pro 


Ser 


Cys 
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Ser 








i a n 

JL*± u 










Aro 
fti -y 


Pro 


Val 


v-yt* 


vj x jf 


Arg 


val 


Ser 
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1 
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Thr 
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Val 


lie 

J. -L 


Car 
OCX. 


-L XIX 


v-ys 


Pro 




x / u 
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Thr 


Ser 


He 


Ser 


Tyr Arg Pro Ala 


Cys 


Tyr 


Arg 


Pro 


305 










310 






315 


320 


Cys 


Tyr 


Ser 


He 


Leu Arg 


Arg 


Pro 


Ala Cys Val Thr 


Ser 


Tyr 


Ser 


Cys 










325 








330 




335 


Arg 


Pro 


Val 


Tyr 


Phe 


Arg 


Pro 


Ser 


Cys Thr Glu Ser Asp 


Ser 


Cys 


Lys 








340 










345 




350 


Arg Asp 


Cys 


Lys 


Lys 


Ser 


Thr 


Ser 


Ser Gin Leu Asp 


Cys 


Val 


Asp 


Thr 






355 










360 


365 






Thr 


Pro 


Cys 


Lys 


Val 


Asp 


Val 


Ser Glu Glu Ala Pro 


Cys 


Gin 


Pro 


Thr 




370 










375 




380 
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Glu Ala 


Lys 


Pro 


He 


Ser 


Pro 


Thr 


385 








«3 27 U 






Pro Ala 


Ala 


Ser 


Lys 


Pro 


Ala 


Asn 








405 








<210> 424 












<211> 105 












<212> PRT 












<213> Homo 


sapiens 








<400> 424 












Met Gly 


Cys 


Cys 


Pro 


Gly 


Asp 


Cys 


1 






5 








Asn Cys 


Cys 


Glu 


Glu 


Cys 


Cys 


Cys 






20 










Gly Ser 


Cys 


Cys 


Gly 


Cys 


Gly 


Gly 




35 










40 


Gly Gly 


Ser 


Cys 


Cys 


Gly 


Ser 


Ser 


50 










55 




Cys Gly 


Gly 


Cys 


Gly 


Gly 


Cys 


Gly 


65 








70 






Cys Gly 


Ser 


Ser 


Cys 


Cys 


Gly 


Ser 








85 








Gin Pro 


Thr 


Pro 


He 


Cys 


Asp 


Thr 






100 










<210> 425 












<211> 404 












<212> PRT 












<213> Homo 


sapiens 








<400> 425 












Met Ser 


Tyr 


Ser 


Cys 


Gly 


Leu 


Pro 


1 






5 








Ser Ser 


Arg 


Pro 


Cys 


Val 


Pro 


Pro 






20 










Gly Ala 


Cys 


Asn 


lie 


Pro 


Ala 


Asn 




35 










40 


Glu Gly 


Ser 


Phe 


Asn 


Gly 


Ser 


Glu 


50 










55 




Asp Arg 


Leu 


Ala 


Ser 


Tyr 


Leu 


Glu 


65 








70 






Asn Ala 


Glu 


Leu 


Glu 


Asn 


Leu 


He 








85 








Pro Leu 


Val 


Cys 


Ala 


Ser 


Tyr 


Gin 






100 










Leu Gin 


Gin 


Lys 


lie 


Leu 


Cys 


Ser 




115 










120 


Val Gin 


He 


Asp 


Asn 


Ala 


Lys 


Leu 


130 










135 




Tyr Glu 


Thr 


Glu 


Leu 


Ser 


Leu 


Arg 


145 








150 






Gly Leu 


Arg 


Arg 


He 


Leu 


Asp 


Glu 








165 








Glu Ala 


Gin 


Val 


Glu 


Ser 


Leu 


Lys 






180 










Asn His 


Glu 


Gin 


Glu 


val 


Asn 


Thr 




195 










200 
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Thr 


Arg Glu Ala Ala 


Ala 


Ala 


Gin 




395 






400 


Cys 










Phe 


Thr Cys Cys Thr 


Gin 


Glu 


Gin 




10 




15 




Gin 


Pro Gly Cys Cys 


Gly Cys 


Cys 


25 




J u 






Ser 


Gly Cys Gly Gly 


Ser 


Gly Cys 




45 








Cys 


Cys Gly Ser Gly 


Cys 


Gly Gly 




60 








Gly 


Gly Cys Cys Gly 


Ser 


Ser 


Cys 




75 






80 


Gly 


Cys Cys Gly Pro 


Val 


Cys 


Cys 




90 




QC 




Lys 










105 










Ser 


Leu Ser Cys Arg 


Thr 


Ser 


Cys 




10 




15 




Ser 


Cys His Gly Cys 


Thr 


Leu 


Pro 


25 




30 






Val 


Ser Asn Cys Asn 


Trp 


Phe 


Cys 




45 








Lys 


Glu Thr Met Gin 


Phe 


Leu 


Asn 




60 








Lys 


Val Arg Gin Leu 


Glu 


Arg 


Asp 




75 






80 


Arg 


Glu Arg Ser Gin 


Gin 


Gin 


Glu 




90 




95 




Ser 


Tyr Phe Lys Thr 


He 


Glu 


Glu 


105 




110 






Lys 


Ser Glu Asn Ala 


Arg 


Leu 


Val 


Ala 


125 








Ser Asp Asp Phe 


Arg 


Thr 


Lys 




140 








Gin 


Leu Val Glu Ser 


Asp 


He 


Asn 




155 






160 


Leu 


Thr Leu Cys Arg 


Ser 


Asp 


Leu 




170 




175 




Glu 


Glu Leu Leu Cys 


Leu 


Lys 


Gin 


185 


190 




Leu 


Arg Cys Gin Leu 


Gly Asp Arg 



205 
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Leu 


Asn 


Val 


Glu 


Val 


Asp 


Ala 


Ala 




210 










215 




Leu 


Asn 


Glu 


Thr 


Arg 


Ser 


Gin 


Tyr 


225 










230 






Arg 


Glu 


Val 


Glu 


Gin 


Trp 


Phe 


Ala 










245 








Gin 


Val 


Val 


Ser 


Ser 


Ser 


Glu 


Gin 








260 










He 


Glu 


Leu 


Arg 


Arg 


Thr 


Val 


Asn 






275 










280 


Gin 


His 


Asn 


Leu 


Arg 


Asp 


Ser 


Leu 




290 










295 




Ala 


Atq 


Tvr 


Ser 


Ser 


Gin 


Leu 


Ser 


305 










310 






.Val 


Glu 


Ser 


Gin 


Leu 


Ala 


Glu 


He 










325 








Gin 


Glu 


Tyr 


Gin 


Val 


Leu 


Leu 


Asp 








340 










He 


Asn 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu 






355 










360 


Ser 


Asn 


Pro 


Cys 


Ala 


Thr 


Thr 


Asn 




370 










375 




Cys 


He 


Ser 


Asn 


Pro 


Cys 


Gly 


Leu 


385 










390 






Thr 


Phe 


Gly 


Tyr 










<210> 426 












<211> 404 












<212> PRT 












<213> Homo 


sapiens 








<400> 426 












Met 


Pro 


Tyr 


Asn 


Phe 


Cvs 


Leu 


Pro 


1 








5 








Ser 


Ser 


Arg 


Pro 


Cys 


Val 


Pro 


Pro 








20 










Gly 


Ala 


Cys 


Asn 


He 


Pro 


Ala 


Asn 






35 










40 


Glu 


Gly 


Ser 


Phe 


Asn 


Gly 


Ser 


Glu 




50 










55 




Asp 


Arg 


Leu 


Ala 


Ser 


Tyr 


Leu 


Glu 


65 










70 






Asn 


Ala 


Glu 


Leu 


Glu 


Asn 


Leu 


He 










85 








Pro 


Leu 


Leu 


Cys 


Pro 


Ser 


Tyr 


Gin 








100 










Leu 


Gin 


Gin 


Lys 


He 


Leu 


Cys 


Ser 






115 










120 


Val 


Gin 


He 


Asp 


Asn 


Ala 


Lys 


Leu 




130 










135 




Tyr 


Gin 


Thr 


Glu 


Gin 


Ser 


Leu 


Arg 


145 










150 






Ser 


Leu 


Arg 


Arg 


He 


Leu 


Asp 


Glu 










165 








Glu 


Ala 


Gin 


Met 


Glu 


Ser 


Leu 


Lys 








180 










Asn 


His 


Glu 


Gin 


Glu 


Val 


Asn 


Thr 






195 










200 


Leu 


Asn 


Val 


Glu 


Val 


Asp 


Ala 


Ala 




210 










215 
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Pro 


Thr 


Val 


Asp 


Leu 


Asn 


Gin 


Val 








220 










Glu 


Ala 


Leu 


Val 


Glu 


Thr 


Asn 


Arg 






235 










240 


Thr 


Gin 


Thr 


Glu 


Glu 


Leu 


Asn 


Lys 




250 










255 




Leu 


Gin 


Ser 


Tyr 


Gin 


Ala 


Glu 


He 


265 










270 






Ala 


Leu 


Glu 


He 


Glu 


Leu 


Gin 


Ala 










285 








Glu 


Asn 


Thr 


Leu 


Thr 


Glu 


Ser 


Glu 








300 










Gin 


Val 


Gin 


Arg 


Leu 


He 


Thr 


Asn 






315 










320 


Arg 


Ser 


Asp 


Leu 


Glu 


Arg 


Gin 


Asn 




330 










335 




Val 


Arg 


Ala 


Arg 


Leu 


Glu 


Cys 


Glu 


345 










350 






Glu 


Ser 


Glu 


Asp 


Cys 


Lys 


Leu 


Pro 










365 








Ala 


Cys 


Asp 


Lys 


Ser 


Thr 


Gly 


Pro 








380 










Arg 


Ala 


Arg 


Cys 


Gly 


Pro 


Cys 


Asn 






395 










400 


Ser 


Leu 


Ser 


Cvs 


Ara 


Thr 


Ser 


Cvs 




10 










15 




Ser 


Cvs 


His 


Gly 


Tvr 


Thr 


Leu 


Pro 


25 










30 






Val 


Ser 


Asn 


Cvs 


Asn 


Trp 


Phe 


Cys 










45 








Lys 


Glu 


Thr 


Met 


Gin 


Phe 


Leu 


Asn 








60 










Lys 


Val 


Arg 


Gin 


Leu 


Glu 


Arg 


Asp 






75 










80 


Arg 


Glu 


Arq 


Ser 


Gin 


Gin 


Gin 


Glu 




90 










95 




Ser 


Tyr 


Phe 


Lys 


Thr 


He 


Glu 


Glu 


105 










110 






Lys 


Ser 


Glu 


Asn 


Ala 


Arg 


Leu 


Val 










125 








Ala 


Ala 


Asp 


Asp 


Phe 


Arg 


Thr 


Lys 








140 










Gin 


Leu 


Val 


Glu 


Ser 


Asp 


He 


Asn 






155 










160 


Leu 


Thr 


Leu 


Cys 


Arg 


Ser 


Asp 


Leu 




170 










175 




Glu 


Glu 


Leu 


Leu 


Ser 


Leu 


Lys 


Gin 


185 










190 






Leu 


Arg 


Cys 


Gin 


Leu 


Gly 


Asp 


Arg 










205 








Pro 


Ala 


Val 


Asp 


Leu 


Asn 


Gin 


Val 



220 
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231 



Leu 


Asn 


Glu 


Thr 


Arg Asn Gin Tyr Glu Ala Leu Val 


Glu Thr 


Asn 


Arg 


225 










230 




235 






240 


Arg 


Glu 


Val 


Glu 


Gin 


Trp 


Phe 


Ala Thr Gin Thr Glu 


Glu Leu 


Asn 


Lys 


Gin 








245 






250 




255 


Val 


Val 


Ser 


Ser 


Ser 


Glu 


Gin Leu Gin Ser Tyr 


Gin Ala 


Glu 


He 


lie 






260 








265 


270 






Glu 


Leu Arg 


Arg 


Thr 


Val 


Asn Ala Leu Glu lie 


Glu Leu 


Gin 


Ala 






275 










280 


285 






Gin 


His 


Asn 


Leu 


Arg 


Tyr 


Ser 


Leu Glu Asn Thr Leu 


Thr Glu 


Ser 


Glu 




290 










295 


300 








Ala 


Arg 


Tyr 


Ser 


Ser 


Gin 


Leu 


Ser Gin Val Gin Ser 


Leu He 


Thr 


Asn 


305 










310 




315 






320 


Val 


Glu 


Ser 


Gin 


Leu 


Ala 


Glu 


He Arg Ser Asp Leu Glu Arg Gin Asn 


Gin 








325 






330 




335 




Glu 


Tyr Gin 


Val 


Leu 


Leu 


Asp Val Arg Ala Arg Leu Glu Cys Glu 


lie 






340 








345 


350 






Asn 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu Glu Ser Glu Asp 


Cys Lys 


Leu 


Pro 






355 










360 


365 






Ser 


Asn 


Pro 


Cys 


Ala 


Thr 


Thr 


Asn Ala Cys Glu Lys 


Pro He 


Gly 


Ser 




370 










375 


380 






Cys 


Val 


Thr Asn 


Pro 


Cys 


Gly 


Pro Arg Ser Arg Cys 


Gly Pro 


Cys 


Asn 


385 










390 




395 


400 


Thr 


Phe 


Gly 


Tyr 















<210> 427 

<211> 436 

<212> PRT 

<213> Homo sapiens 



<400> 427 

Met Leu Tyr Ala Lys Pro Pro Pro Thr He Asn Gly He Lys Gly Leu 

1 5 10 15 

Gin Arg Lys Glu Arg Leu Lys Pro Ala His He His Leu Gin Gin Leu 

20 25 30 

Thr Cys Phe Ser He Thr Cys Ser Ser Thr Met Ser Tyr Ser Cys Cys 

35 40 45 

Leu Pro Ser Leu Gly Cys Arg Thr Ser Cys Ser Ser Arg Pro Cys Val 

50 55 60 

Pro Pro Ser Cys His Gly Tyr Thr Leu Pro Gly Ala Cys Asn He Pro 
65 70 75 80 

Ala Asn Val Ser Asn Cys Asn Trp Phe Cys Glu Gly Ser Phe Asn Gly 

85 90 ~ 95 

Ser Glu Lys Glu Thr Met Gin Phe Leu Asn Asp Arg Leu Ala Ser Tyr 

100 105 " no 

Leu Glu Lys Val Arg Gin Leu Glu Arg Asp Asn Ala Glu Leu Glu Lys 

115 120 125 

Leu He Gin Glu Arg Ser Gin Gin Gin Glu Pro Leu Leu Cys Pro Ser 

130 135 140 

Tyr Gin Ser Tyr Phe Lys Thr He Glu Glu Leu Gin Gin Lys He Leu 
14 5 150 155 160 

Cys Ala Lys Ala Glu Asn Ala Arg Leu Val Val Asn He Asp Asn Ala 

165 170 175 

Lys Leu Ala Ser Asp Asp Phe Arg Ser Lys Tyr Gin Thr Glu Gin Ser 

180 185 190 

Leu Arg Leu Leu Val Glu Ser Asp He Asn Ser He Arg Arg He Leu 

195 200 205 

Asp Glu Leu Thr Leu Cys Lys Ser Asp Leu Glu Ser Gin Val Glu Ser 

210 215 220 

Leu Arg Glu Glu Leu He Cys Leu Lys Lys Asn His Glu Glu Glu Val 
225 230 235 240 



BHC 03 1 051 



Asn 


Thr 


Leu 


Arg 


Ser 


Gin 


Leu 


Gly 










245 








Thr 


Ala 


Pro 


Thr 


Val 


Asp 


Leu 


Asn 








260 










Gin 


Tyr 


Glu 


Ala 


Leu 


Val 


Glu 


He 






275 










280 


Phe 


Ala 


Thr 


Gin 


Thr 


Glu 


Glu 


Leu 




290 










295 




Glu 


Gin 


Leu 


Gin 


Ser 


Cys 


Gin 


Ala 


305 










310 






Val 


Asn 


Ala 


Leu 


Glu 


He 


Glu 


Leu 










325 








Ser 


Leu 


Glu 


Asn 


Thr 


Leu 


Thr 


Glu 








340 










Leu 


Ser 


Gin 


Val 


Gin 


Ser 


Leu 


He 






355 










360 


Glu 


He 


Arg 


Cys 


Asp 


Leu 


Glu 


Arg 




370 










375 




Leu 


Asp 


Val 


Arg 


Ala 


Arg 


Leu 


Glu 


385 










390 






Leu 


Leu 


Glu 


Ser 


Glu 


Asp 


Cys 


Lys 










405 








Thr 


Asn 


Ala 


Ser 


Gly 


Asn 


Ser 


Cys 








420 










Gly 


Cys 


Cys 


Asn 














435 












<210> 428 












<211> 416 












<212> PRT 












<213> Homo 


sapiens 








<400> 428 












Met 


Pro 


Tyr 


Asn 


Phe 


Cys 


Leu 


Pro 


1 








5 








Ser 


Ser 


Arg 


Pro 


Cys 


Val 


Pro 


Pro 








20 










Gly 


Ala 


Cys 


Asn 


He 


Pro 


Ala 


Asn 






35 










40 


Glu 


Gly 


Ser 


Phe 


Asn 


Gly 


Ser 


Glu 




50 










55 




Asp 


Arg 


Leu 


Ala 


Ser 


Tyr 


Leu 


Glu 


65 










70 






Asn 


Ala 


Glu 


Leu 


Glu 


Asn 


Leu 


He 










85 








Pro 


Leu 


Leu 


Cys 


Pro 


Ser 


Tyr 


Gin 








100 










Leu 


Gin 


Gin 


Lys 


He 


Leu 


Cys 


Thr 






115 










120 


Val 


Gin 


He 


Asp 


Asn 


Ala 


Lys 


Leu 




130 










135 




Tyr 


Gin 


Thr 


Glu 


Leu 


Ser 


Leu 


Arg 


145 










150 






Gly 


Leu 


Arg 


Arg 


He 


Leu 


Asp 


Glu 










165 








Glu 


Ala 


Gin 


Val 


Glu 


Ser 


Leu 


Lys 








180 










Asn 


His 


Glu 


Gin 


Glu 


Val 


Asn 


Thr 






195 










200 


Leu 


Asn 


Val 


Glu 


Val 


Asp 


Ala 


Ala 




210 










215 
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Asp 


Arg 


Leu 


Asn 


Val 


Glu 


Val 


Asp 




250 










255 




Gin 


Val 


Leu 


Asn 


Glu 


Thr 


Arg 


Ser 


265 










270 






Asn 


Arg 


Arg 


Glu 


Val 


Glu 


Gin 


Trp 










285 








Asn 


Lys 


Gin 


Val 


Val 


Ser 


Ser 


Ser 








300 










Glu 


He 


He 


Glu 


Leu 


Arg 


Arg 


Thr 






315 










320 


Gin 


Ala 


Gin 


His 


Asn 


Leu 


Arg 


Asp 




330 










335 




Ser 


Glu 


Ala 


His 


Tyr 


Ser 


Ser 


Gin 


345 










350 






Thr 


Asn 


Val 


Glu 


Ser 


Gin 


Leu 


Ala 










365 








Gin 


Asn 


Gin 


Glu 


Tyr 


Gin 


Val 


Leu 








380 










Cys 


Glu 


He 


Asn 


Thr 


Tyr 


Arg 


Ser 






395 










400 


Leu 


Pro 


Cys 


Asn 


Pro 


Cys 


Ala 


Thr 




410 










415 




Gly 


Pro 


Cys 


Gly 


Thr 


Ser 


Gin 


Lys 


425 










430 






Ser 


Leu 


Ser 


Cys 


Arg 


Thr 


Ser 


Cys 




10 










15 




Ser 


Cys 


His 


Ser 


Cys 


Thr 


Leu 


Pro 


25 










30 






Val 


Ser 


Asn 


Cys 


Asn 


Trp 


Phe 


Cys 










45 








Lys 


Glu 


Thr 


Met 


Gin 


Phe 


Leu 


Asn 








60 










Lys 


Val 


Arg 


Gin 


Leu 


Glu 


Arg 


Asp 






75 










80 


Arg 


Glu 


Arg 


Ser 


Gin 


Gin 


Gin 


Glu 




90 










95 




Ser 


Tyr 


Phe 


Lys 


Thr 


He 


Glu 


Glu 


105 










110 






Lys 


Ser 


Glu 


Asn 


Ala 


Arg 


Leu 


Val 










125 








Ala 


Ala 


Asp 


Asp 


Phe 


Arg 


Thr 


Lys 








140 










Gin 


Leu 


Val 


Glu 


Ser 


Asp 


He 


Asn 






155 










160 


Leu 


Thr 


Leu 


Cys 


Lys 


Ser 


Asp 


Leu 




170 










175 




Glu 


Glu 


Leu 


Leu 


Cys 


Leu 


Lys 


Ser 


185 










190 






Leu 


Arg 


Cys 


Gin 


Leu 


Gly 


Asp 


Arg 










205 








Pro 


Thr 


Val 


Asp 


Leu 


Asn 


Arg 


Val 



220 
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-233- 

Leu Asn Glu Thr Arg Ser Gin Tyr Glu Ala Leu Val Glu Thr Asn Aro 
225 230 235 240 

Arg Glu Val Glu Gin Trp Phe Thr Thr Gin Thr Glu Glu Leu Asn Lys 

245 250 255 

Gin Val Val Ser Ser Ser Glu Gin Leu Gin Ser Tyr Gin Ala Glu lie 

260 265 270 

He Glu Leu Arg Arg Thr Val Asn Ala Leu Glu He Glu Leu Gin Ala 

275 280 285 

Gin His Asn Leu Arg Asp Ser Leu Glu Asn Thr Leu Thr Glu Ser Glu 

290 295 300 

Ala Arg Tyr Ser Ser Gin Leu Ser Gin Val Gin Ser Leu He Thr Asn 
305 310 315 320 

Val Glu Ser Gin Leu Ala Glu He Arg Ser Asp Leu Glu Arg Gin Asn 

325 330 335 

Gin Glu Tyr Gin Val Leu Leu Asp Val Arg Ala Arg Leu Glu Cys Glu 

340 345 350 

He Asn Thr Tyr Arg Ser Leu Leu Glu Ser Glu Asp Cys Asn Leu Pro 

355 360 365 

Ser Asn Pro Cys Ala Thr Thr Asn Ala Cys Ser Lys Pro He Gly Pro 

370 375 380 

Cys Leu Ser Asn Pro Cys Thr Ser Cys Val Pro Pro Ala Pro Cys Thr 
385 390 395 400 

Pro Cys Ala Pro Arg Pro Arg Cys Gly Pro Cys Asn Ser Phe Val Arg 
405 410 415 

<210> 429 
<211> 201 
<212> PRT 
<213> Homo sapiens 



<400> 429 

Met Thr Ser Asp His Cys Ser Ser Leu Leu Ser Gly Gin Val Ser Glu 

1 5 io is 

Ala Asn Ala Ala Ser Leu Cys Leu Leu Ala Asn Val Ala His Ala Asn 

20 25 30 

Arg Val Arg Val Gly Ser Thr Pro Leu Gly Arg Leu Ser Leu Cys Leu 

35 40 45 

Pro Pro Thr Cys His Thr Thr Cys Pro Leu Pro Gly Thr Cys His He 

50 55 so 

Pro Gly Asn He Gly He Cys Gly Ala Tyr Arg Glu Asn Thr Leu Asn 
6 f . 70 75 80 

Gly His Glu Lys Glu Thr Met Gin Phe Leu Asn Asp Arg Leu Ala Asn 

85 90 - 95 

Tyr Leu Glu Lys Val Arg Gin Leu Glu Trp Asp Asn Ala Glu Leu Glu 

100 105 no 

Thr Lys Leu His Glu Arg Ser Lys Cys His Glu Ser Ser Val Cys Arg 

115 120 125 

Asn Tyr Gin Ser Tyr Phe Cys Thr He Gin Glu Leu Gin Gin Lys Val 

130 135 140 

Arg Phe Ala Val His Gin He Arg Gly Gin Glu Ser Ala Tyr Cys Leu 
i 45 , 150 155 160 

Ser Ala Lys Ser Gly Pro Pro Pro Ala Phe Ala Asn Lys Val Leu Leu 

165 170 175 

Val His Gly His Ala His Ala Phe Val Cys Cys Leu Gin Leu Leu Leu 

180 185 190 

Tyr Tyr Ser Gly Arg Val Gin Ser Leu 

195 200 
<210> 430 

<211> 471 



<212> PRT 
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<213> Homo sapiens 



<400> 430 



Met 


Thr 


Ser 


Phe 


Tyr 


Ser 


Thr 


Ser 


1 








5 








Ala 


Pro 


Gly 


Ala 


Arg 


Asn 


Val 


Phe 








20 










Gin 


Pro 


Val 


Ala 


Glu 


Ala 


Asn 


Ala 






35 










40 


Val 


Ala 


His 


Ala 


Asn 


Arg 


Val 


Arg 




50 










55 




Pro 


Ser 


Leu 


Cvs 

J 


Leu 


Pro 


Pro 


Thr 


65 










70 






Gly Thr 


Cvs 


His 


lie 


Pro 


Glv 


Asn 










85 








Lys 


Asn 


Thr 


Leu 


Asn 


Gly 


His 


Glu 








100 










Asp 


Arg 


Leu 


Ala 


Asn 


Tvr 


Leu 


Glu 






115 










120 


Asn 


Ala 


Glu 


Leu 


Glu 


Thr 


Thr 


Leu 




130 










135 




Ser 


Thr 


Val 


Cvs 


Pro 


Asp 


Tvr 


Gin 


145 










150 






Leu 


Gin 


Gin 


Lvs 


lie 


Leu 


Cvs 


Ser 










165 








Val 


Gin 


lie 
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Lys Asn Arg Tyr Cys Gly Gin Leu Gin Met He Gin Glu Gin He Ser 

405 410 415 

Asn Leu Glu Ala Gin He Thr Asp Val Arg Gin Glu He Glu Cys Gin 

420 425 430 

Asn Gin Glu Tyr Ser Leu Leu Leu Ser He Lys Met Arg Leu Glu Lys 

435 440 "* 445 

Glu He Glu Thr Tyr His Asn Leu Leu Glu Gly Gly Gin Glu Asp Phe 

450 455 460 

Glu Ser Ser Gly Ala Gly Lys He Gly Leu Gly Gly Arg Gly Gly Ser 
465 470 475 480 

Gly Gly Ser Tyr Gly Arg Gly Ser Arg Gly Gly Ser Gly Gly Ser Tyr 

485 490 495 

Gly Gly Gly Gly Ser Gly Gly Gly Tyr Gly Gly Gly Ser Gly Ser Arg 

500 505 510 

Gly Gly Ser Gly Gly Ser Tyr Gly Gly Gly Ser Gly Ser Gly Gly Gly 

515 520 525 

Ser Gly Gly Gly Tyr Gly Gly Gly Ser Gly Gly Gly His Ser Gly Gly 

530 535 540 

Ser Gly Gly Gly His Ser Gly Gly Ser Gly Gly Asn Tyr Gly Gly Gly 
545 550 555 ~ 560 

Ser Gly Ser Gly Gly Gly Ser Gly Gly Gly Tyr Gly Gly Gly Ser Gly 

565 570 575 

Ser Arg Gly Gly Ser Gly Gly Ser His Gly Gly Gly Ser Gly Phe Gly 

580 585 590 

Gly Glu Ser Gly Gly Ser Tyr Gly Gly Gly Glu Glu Ala Ser Gly Ser 

595 600 605 

Gly Gly Gly Tyr Gly Gly Gly Ser Gly Lys Ser Ser His Ser 

610 615 620 

<210> 439 

<211> 472 

<212> PRT 

<213> Homo sapiens 



<400> 439 




Met 
1 


Thr 


Thr Cys 


Ser Arg Gin Phe 
5 


Ser 


Cys 


Gly He 


Gly Gly Gly He 






20 




Ser 


Val 


Leu Ala 


Gly Gly Ser Cys 






35 


40 


Gly 


Leu 


Ser Val 


Ser Ser Ser Arg 




50 




55 


Leu 


Gly 


Gly Gly 


Tyr Gly Gly Gly 


65 






70 


Gly 


Ser 


Gly Phe 


Gly Gly Gly Tyr 








85 


Gly 


Gly 


Gly Phe 


Gly Gly Gly Phe 






100 




Gly 


Ser 


Glu Lys 


Val Thr Met Gin 






115 


120 


Tyr 


Leu 


Asp Lys 


Val Arg Ala Leu 




130 


135 



Thr Ser Ser Ser Ser Met Lys Gly 

10 15 
Gly Gly Gly Ser Ser Arg He Ser 
25 30 
Arg Ala Pro Ser Thr Tyr Gly Gly 
45 

Phe Ser Ser Gly Gly Ala Tyr Gly 
60 

Phe Ser Ser Ser Ser Ser Ser Phe 

75 80 
Gly Gly Gly Leu Gly Ala Gly Leu 

90 ~ 95 

Ala Gly Gly Asp Gly Leu Leu Val 
105 ~ no 

Asn Leu Asn Asp Arg Leu Ala Ser 
125 

Glu Glu Ala Asn Ala Asp Leu Glu 
140 
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Val Lys 


lie 


Arg 


Asp Trp 


Tyr 


Gin 


145 






150 






Asp Tyr 


Ser 


Pro 


Tyr Phe 


Lys 


Thr 








165 






Leu Thr 


Ala 


Thr 


Val Asp 


Asn 


Ala 






180 








Ala Arg 


Leu 


Ala 


Ala Asp 


Asp 


Phe 




195 








200 


Asn Leu 


Arg 


Met 


Ser Val 


Glu 


Ala 


210 








215 




Leu Asp Glu 


Leu 


Thr Leu 


Ala 


Arg 


225 






230 






Sex* Leu 


Lys 


Glu 


Glu Leu 


Ala 


Tyr 








245 






i"C *«- noil 


Ala 


Leu 


Arg Gly 


Gin 


Val 






260 








Asp Ala 


Ala 


Pro 


Gly Val 


Asp 


Leu 




275 








280 


Hop U1X1 


Tyr 


Glu 


Lys Met 


Ala 


Glu 


290 








295 




X X JL> Jf lie 


Phe 


Thr 


Lys Thr 


Glu 


Glu 








310 






O ■>»- /til ii 
OCI VJ1U 


Leu 


Val 


Gin Ser 


Gly 


Lys 








325 






Thr Met 


Gin 


Asn 


Leu Glu 


He 


Glu 






340 








Ala Ser 


-Leu 


Glu 


Asn Ser 


Leu 


Glu 




355 








360 


Gin Leu 


Ala 


Gin 


He Gin 


Glu 


Met 


370 








375 




Ala Gin 


Leu Arg 


Cys Glu 


Met 


Glu 


385 






390 






Leu Leu 


Asp 


Val 


Lys Thr 


Arg 


Leu 








405 






Arg Leu 


Leu 


Glu 


Gly Glu 


Asp 


Ala 






420 








Ser Gly 


Ser 


Gin 


Ser Ser 


Arg 


Asp 




435 








440 


Arg Thr 


Lys 


Val 


Met Asp 


Val 


His 


450 








455 




Glu Gin 


Val 


Leu 


Arg Thr 


Lys 


Asn 


465 






470 






<210> 440 










<211> 473 










<212> PRT 










<213> Homo 


sapiens 






<400> 440 










Met Thr 


Thr 


Cys 


Ser Arg 


Gin 


Phe 


1 






5 






Ser Cys 


Gly 


He 


Gly Gly 


Gly 


He 






20 








Ser Val 


Leu 


Ala 


Gly Gly 


Ser 


Cys 




35 








40 


Gly Leu 


Ser 


Val 


Ser Ser 


Arg 


Phe 


50 








55 




Gly Gly 


Gly Tyr 


Gly Gly 


Gly 


Phe 


65 






70 






Gly Phe 


Gly Gly 


Gly Tyr 


Gly 


Gly 



85 
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Arg 


Gin 


Arg 


Pro 


Ala 


Glu 


He 


Lys 






155 










160 


He 


Glu 


Asp 


Leu 


Arg 


Asn 


Lys 


He 




170 










175 




Asn 


Val 


Leu 


Leu 


Gin 


He 


Asp 


Asn 


185 










190 






Arg 


Thr 


Lys 


Tyr 


Glu 


Thr 


Glu 


Leu 










205 








Asp 


He 


Asn 


Gly 


Leu 


Arg 


Arg 


Val 








220 










Ala 


Asp 


Leu 


Glu 


Met 


Gin 


He 


Glu 




235 










240 


Leu 


Lys 


Lys' 


Asn 


His 


Glu 


Glu 


Glu 




250 










255 




Gly 


Gly 


Asp 


Val 


Asn 


Val 


Glu 


Met 


265 










270 






Ser 


Arg 


He 


Leu 


Asn 


Glu 


Met 


Arg 










285 








Lys 


Asn 


Arg 


Lys 


Asp 


Ala 


Glu 


Glu 








300 










Leu 


Asn 


Arg 


Glu 


Val 


Ala 


Thr 


Asn 






315 










320 


Ser 


Glu 


He 


Ser 


Glu 


Leu 


Arg 


Arg 




330 










335 




Leu 


Gin 


Ser 


Gin 


Leu 


Ser 


Met 


Lys 


345 










350 






Glu 


Thr 


Lys 


Gly 


Arg 


Tyr 


Cys 


Met 










365 








He 


Gly 


Ser 


Val 


Glu 


Glu 


Gin 


Leu 








380 










Gin 


Gin 


Asn 


Gin 


Glu 


Tyr 


Lys 


He 






395 










400 


Glu 


Gin 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 




410 










415 




His 


Leu 


Ser 


Ser 


Ser 


Gin 


Phe 


Ser 


425 










430 






Val 


Thr 


Ser 


Ser 


Ser 


Arg 


Gin 


He 










445 








Asp 


Gly 


Lys 


Val 


Val 


Ser 


Thr 


His 








460 










Thr 


Ser 


Ser 


Ser 


Ser 


Met 


Lys 


Gly 




10 










15 




Gly 


Gly 


Gly 


Ser 


Ser 


Arg 


He 


Ser 


25 










30 






Arg 


Ala 


Pro 


Ser 


Thr 


Tyr 


Gly 


Gly 










45 








Ser 


Ser 


Gly 


Gly 


Ala 


Cys 


Gly 


Leu 








60 










Ser 


Ser 


Ser 


Ser 


Ser 


Phe 


Gly 


Ser 






75 










80 


Gly 


Leu 


Gly 


Ala 


Gly 


Phe 


Gly 


Gly 




90 










95 
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Gly 


Leu 


Gly Ala 


Gly Phe 


Gly Gly Gly Phe Ala 


Gly Gly Asp Gly Leu 








100 








105 








110 






Leu 


Val 


Gly 


Ser 


Glu 


Lys 


Val 


Thr Met 


Gin Asn 


Leu 


Asn 


Asp Arg Leu 






115 










120 






125 








Ala 


Ser 


Tyr 


Leu 


Asp 


Lys 


Val 


Arg Ala 


Leu Glu 


Glu 


Ala 


Asn 


Ala 


Asp 




130 










135 






140 








lieu 


Glu 


Val 


Lys 


He Arg 


Asp Trp Tyr Gin Arg 


Gin Arg 


Pro 


Ser 


Glu 


145 










150 






155 










160 


He 


Lys 


Asp 


Tyr 


Ser 


Pro 


Tyr 


Phe Lys 


Thr He 


Glu 


Asp 


Leu 


Arg Asn 










165 








170 








175 




Lys 


He 


He 


Ala 


Ala 


Thr 


He 


Glu Asn 


Ala Gin 


Pro 


He 


Leu 


Gin 


He 








180 








185 








190 






Asp 


Asn 


Ala 


Arg 


Leu 


Ala 


Ala 


Asp Asp 


Phe Arg 


Thr 


Lys 


Tyr 


Glu 


His 






195 










200 






205 






Glu 


Leu 


Ala 


Leu 


Arq 


Gin 


Thr 


Val Glu 


Ala Asp 


Val 


Asn Gly Leu Arg 




210 










215 




220 










Arg 


Val 


Leu Asp 


Glu 


Leu 


Thr 


Leu Ala 


Arg Thr Asp 


Leu 


Glu 


Met 


Gin 


225 










230 






235 










240 


He 


Glu 


Gly Leu 


Lys 


Glu 


Glu 


Leu Ala 


Tyr Leu 


Arg 


Lys 


Asn 


His 


Glu 










245 








250 




255 




Glu 


Glu 


Met 


Leu 


Ala 


Leu 


Arg Gly Gin Thr Gly Gly Asp Val 


Asn 


val 








260 








265 








270 






Glu 


Met 


Asp 


Ala 


Ala 


Pro 


Gly Val Asp 


Leu Ser 


Arg 


He 


Leu 


Asn 


Glu 






275 










280 




285 








Met 


Arg 


Asp 


Gin 


Tyr 


Glu 


Gin 


Met Ala 


Glu Lys 


Asn 


Arg 


Arg 


Asp 


Ala 




290 










295 






300 




Glu 


Thr 


Trp 


Phe 


Leu 


Ser 


Lys 


Thr Glu 


Glu Leu 


Asn 


Lys 


Glu 


Val 


Ala 


305 










310 






315 








320 


Ser 


Asn 


Ser 


Glu 


Leu 


Val 


Gin 


Ser Ser 


Arg Ser 


Glu 


Val 


Thr 


Glu 


Leu 










325 








330 








335 




Arg 


Arg 


Val 


Leu 


Gin Gly 


Leu 


Glu He 


Glu Leu 


Gin 


Ser 


Gin 


Leu 


Ser 








340 








345 








350 






Met 


Lys 


Ala 


Ser 


Leu 


Glu 


Asn 


Ser Leu 


Glu Glu 


Thr 


Lys 


Gly Arg Tyr 






355 










360 






365 








Cys 


Met 


Gin 


Leu 


Ser 


Gin 


He 


Gin Gly Leu He 


Gly Ser Val 


Glu 


Glu 




370 










375 






380 










Gin 


Leu 


Ala 


Gin 


Leu Arg 


Cys 


Glu Met 


Glu Gin 


Gin 


Ser 


Gin 


Glu 


Tyr 


385 










390 






395 










400 


Gin 


He 


Leu 


Leu 


Asp 


Val 


Lys 


Thr Arg 


Leu Glu 


Gin 


Glu 


He 


Ala 


Thr 










405 








410 








415 




Tyr 


Arg 


Arg Leu 


Leu 


Glu 


Gly Glu Asp Ala His 


Leu 


Ser 


Ser 


Gin 


Gin 








420 








425 








430 






Ala 


Ser 


Gly Gin 


Ser 


Tyr 


Ser 


Ser Arg 


Glu Val 


Phe 


Thr 


Ser 


Ser 


Ser 






435 










440 






445 








Ser 


Ser 


Ser 


Ser 


Arg 


Gin 


Thr 


Arg Pro 


He Leu 


Lys 


Glu 


Gin 


Ser 


Ser 




450 










455 






460 










Ser 


Ser 


Phe 


Ser 


Gin Gly 


Gin 


Ser Ser 














465 










470 


















<210> 441 
























<211> 432 
























<212> PRT 
























<213> Homo 


sapiens 





















<400> 441 

Met Thr Thr Ser He Arg Gin Phe 
1 5 
Ser Ser Gly Leu Gly Gly Gly Ser 
20 

Gly Gly Leu Gly Ala Gly Ser Cys 
35 " 40 



Thr Ser Ser Ser Ser He Lys Gly 

10 is 
Ser Arg Thr Ser Cys Arg Leu Ser 
25 30 
Arg Leu Gly Ser Ala Gly Gly Leu 
45 
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Gly Ser Thr Leu Gly Gly Ser Ser Tyr Ser Ser Cys Tyr Ser Phe Gly 

50 55 60 

Ser Gly Gly Gly Tyr Gly Ser Ser Phe Gly Gly Val Asp Gly Leu Leu 
65 " 70 75 80 

Ala Gly Gly Glu Lys Ala Thr Met Gin Asn Leu Asn Asp Arg Leu Ala 

85 90 95 

Ser Tyr Leu Asp Lys Val Arg Ala Leu Glu Glu Ala Asn Thr Glu Leu 

100 105 110 

Glu Val Lys lie Arg Asp Trp Tyr Gin Arg Gin Ala Pro Gly Pro Ala 

115 120 125 

Arg Asp Tyr Ser Gin Tyr Tyr Arg Thr lie Glu Glu Leu Gin Asn Lys 

130 135 140 

lie Leu Thr Ala Thr Val Asp Asn Ala Asn lie Leu Leu Gin He Asp 
145 150 155 160 

Asn Ala Arg Leu Ala Ala Asp Asp Phe Arg Thr Lys Phe Glu Thr Glu 

165 170 175 

Gin Ala Leu Arg Leu Ser Val Glu Ala Asp He Asn Gly Leu Arg Arg 

180 185 190 

Val Leu Asp Glu Leu Thr Leu Ala Arg Ala Asp Leu Glu Met Gin He 

195 200 205 

Glu Asn Leu Lys Glu Glu Leu Ala Tyr Leu Lys Lys Asn His Glu Glu 

210 " 215 220 

Glu Met Asn Ala Leu Arg Gly Gin Val Gly Gly Glu He Asn Val Glu 
225 230 235 240 

Met Asp Ala Ala Pro Gly Val Asp Leu Ser Arg He Leu Asn Glu Met 

245 250 255 

Arg Asp Gin Tyr Glu Lys Met Ala Glu Lys Asn Arg Lys Asp Ala Glu 

260 265 270 

Asp Trp Phe Phe Ser Lys Thr Glu Glu Leu Asn Arg Glu Val Ala Thr 

275 280 285 

Asn Ser Glu Leu Val Gin Ser Gly Lys Ser Glu He Ser Glu Leu Arg 

290 295 300 

Arg Thr Met Gin Ala Leu Glu He Glu Leu Gin Ser Gin Leu Ser Met 
305 310 315 320 

Lys Ala Ser Leu Glu Gly Asn Leu Ala Glu Thr Glu Asn Arg Tyr Cys 

325 330 335 

Val Gin Leu Ser Gin He Gin Gly Leu He Gly Ser Val Glu Glu Gin 

340 345 350 

Leu Ala Gin Leu Arg Cys Glu Met Glu Gin Gin Asn Gin Glu Tyr Lys 

355 360 365 

He Leu Leu Asp Val Lys Thr Arg Leu Glu Gin Glu He Ala Thr Tyr 

370 375 380 

Arg Arg Leu Leu Glu Gly Glu Asp Ala His Leu Thr Gin Tyr Lys Lys 
385 390 395 400 

Glu Pro Val Thr Thr Arg Gin Val Arg Thr He Val Glu Glu Val Gin 

405 410 415 

Asp Gly Lys Val He Ser Ser Arg Glu Gin. Val His Gin Thr Thr Arg 
420 425 430 

<210> 442 
<211> 469 
<212> PRT 
<213> Homo sapiens 



<400> 442 

Met Ser He His Phe Ser Ser Pro 
1 5 
Phe Ser Gly Arg Gly Ala Gin Val 
20 

Gly Leu Gly Ser Ser Ser Leu Tyr 
35 40 



Val Phe Thr Ser Arg Ser Ala Ala 

10 15 
Arg Leu Ser Ser Ala Arg Pro Gly 
25 30 
Gly Leu Gly Ala Ser Arg Pro Arg 
45 
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Val 


Ala 


Val 


Arg 


Ser 


Ala 


Tyr 


Gly 




50 










55 




Glu 


Val 


Thr 


He 


Asn 


Gin 


Ser 


Leu 


65 










70 






Asp 


Pro 


Ser 


Leu 


Gin 


Arg 


Val 


Arg 










85 








Thr 


Leu 


Asn 


Asn 


Lys 


Phe 


Ala 


Ser 








100 










Glu 


Gin 


Gin 


Asn 


Lys 


Leu 


Leu 


Glu 






115 










120 


Gin 


Lys 


Ser 


Ala 


Lys 


Ser 


Ser 


Arg 




130 










135 




lie 


Ala 


Gly 


Leu 


Arg 


Gly 


Gin 


Leu 


145 










150 






Arg 


Leu 


Glu 


Ala 


Glu 


Leu 


Arg 


Ser 










165 








Lys 


Asn 


Lys 


Tyr 


Glu 


Asp 


Glu 


He 








180 










Glu 


Phe 


Val 


Val 


Leu 


Lys 


Lys 


Asp 






195 










200 


Val 


Glu 


Leu 


Glu 


Ala 


Lys 


Val 


Asp 




210 










215 




Leu 


Arg 


Thr 


Leu 


Asn 


Glu 


Thr 


Glu 


225 










230 






Ser 


Asp 


Thr 


Ser 


Val 


Val 


Leu 


Ser 










245 








Leu 


Asp 


Gly 


He 


He 


Ala 


Glu 


Val 








260 










Lys 


Cys 


Ser 


Arg 


Ala 


Glu 


Ala 


Glu 






275 










280 


Thr 


Leu 


Gin 


Ala 


Gin 


Ala 


Gly 


Lys 




290 










295 


Arg 


Asn 


Glu 


He 


Ser 


Glu 


Met 


Asn 


305 










310 






Glu 


He 


Asp 


Asn 


He 


Lys 


Asn 


Gin 










325 








Ala 


Glu 


Ala 


Glu 


Glu 


Arg 


Gly 


Glu 








340 










Lys 


Gin 


Glu 


Glu 


Leu 


Glu 


Ala 


Ala 






355 










360 


Ala 


Arg 


Gin 


Leu 


Arg 


Glu 


Tyr 


Gin 




370 










375 




Leu 


Asp 


He 


Glu 


He 


Ala 


Thr 


Tyr 


385 










390 




Ser 


Arg 


Leu 


Ala 


Gly 


Asp 


Gly 


Val 










405 








Asn 


Ser 


Thr 


Gly 


Gly 


Ser 


Ser 


Ser 








420 










Gly 


Gly 


Thr 


Met 


Gly 


Ser 


Asn 


Ala 






435 










440 


Pro 


Gly 


Leu 


Leu 


Lys 


Ala 


Tyr 


Ser 




450 










455 




Arg 


Ser 


Ala 


Arg 


Asp 









465 
<210> 443 

<211> 486 
» 

<212> PRT 
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Gly Pro Val 


Gly 


Ala 


Gly 


He Arg 




60 










Leu Ala Pro 


Leu 


Arg 


Leu 


Asp Ala 


75 










80 


Gin Glu Glu 


Ser 


Glu 


Gin 


He 


Lys 


90 








95 


Phe He Asp 


Lys 


Val 


Arg 


Phe 


Leu 


105 






110 






Thr Lys Trp 


Thr 


Leu 


Leu 


Gin 


Glu 






125 








Leu Pro Asp 


He 


Phe 


Glu 


Ala 


Gin 




140 










Glu Ala Leu 


Gin 


Val 


Asp 


Gly Gly 


155 










160 


Met Gin Asp 


Val 


Val 


Glu 


Asp 


Phe 


170 








175 




Asn Arg Arg 


Thr 


Ala 


Ala 


Glu 


Asn 


185 






190 






Val Asp Ala 


Ala 


Tyr 


Met 


Ser 


Lys 






205 








Ala Leu Asn 


Asp 


Glu 


He 


Asn 


Phe 




220 










Leu Thr Glu 


Leu 


Gin 


Ser 


Gin 


He 


235 










240 


Met Asp Asn 


Ser 


Arg 


Ser 


Leu Asp 


250 








255 




Lys Ala Gin 


Tyr 


Glu 


Glu 


Met 


Ala 


265 






270 






Ala Trp Tyr 


Gin 


Thr 


Lys 


Phe 


Glu 






285 








His Gly Asp 


Asp 


Leu 


Arg 


Asn 


Thr 




300 










Arg Ala He 


Gin 


Arg 


Leu 


Gin 


Ala 


315 










320 


Arg Ala Lys 


Leu 


Glu 


Ala 


Ala 


He 


330 








335 




Leu Ala Leu 


Lys 


Asp 


Ala 


Arg Ala 


345 






350 






Leu Gin Arg 


Ala 


Lys 


Gin 


Asp 


Met 






365 








Glu Leu Met 


Ser 


Val 


Lys 


Leu 


Ala 




380 










Arg Lys Leu 


Leu 


Glu 


Gly 


Glu 


Glu 


395 










400 


Gly Ala Val 


Asn 


He 


Ser 


Val 


Met 


410 








415 




Gly Gly Gly 


He 


Gly 


Leu 


Thr 


Leu 


425 




430 






Leu Ser Phe 


Ser 


Ser 


Ser 


Ala Gly 






445 








He Arg Thr 


Ala 


Ser 


Ala 


Ser Arg 



460 
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<400> 443 




Met 


Thr 


Cys 


Gly 


1 








Ala 


Cys 




Pro 








20 


Arg Gly 


He 


Ser 






35 




Ser 


Val 


Cys 


Gly 




50 






Tyr 


Arg 


Ser 


Gly 


65 








Val 


Ser 


Val 


Asn 


Pro 


Asn 


Ala 


Gin 








100 


Leu 


Asn 


Ser 


Arg 






115 




Gin 


Gin 


Asn 


Lys 




130 






Glu 


Cvs 


Cys 


Gin 


145 








Thr 


Leu 


Arg 


Arg 


Ala 


Ser 


GJLU 


Leu 








180 


Lys 


Tyr 


Glu 


Glu 






195 




Val 


Ala 


Leu 


Lys 




210 






Leu 


Glu 


Ala 


Asn 


225 








Arg 


Leu 


Tyr 


Glu 


Thr 


Ser 


Val 


Val 








260 


Cys 


He 


He 


Ala 






275 




Ser Arg 


Ala 


Glu 




290 






Lys 


Ala 


Thr 


Val 


305 








Glu 


He 


Asn 


Glu 


Glu 


Asn 


Ala 


Lys 








340 


Ser 


Glu 


Gin 


Gin 






355 




Ala 


Glu 


Leu 


Glu 




370 






Leu 


He 


Arg 


Glu 


3B5 








He 


Glu 


He 


Ala 


Leu 


Cys 


pi 

LjAU 


Gly 








/ion 


Gly Gly 


val 


Val 






435 




Val 


Ser 


Thr 


Arg 




450 






Gly Thr 


Thr 


Asn 


465 








Gly Ser 


Cys 


Lys 


<210> 444 





Ser 


Tyr 


Cys 


Gly 


5 








Arg 


Pro 


Gly 


Arg 


Cys 


Tyr 


Axg 










40 


Gly 


Phe 


Arg 


Ala 






55 




Gly 


Val 


Cys 


Gly 




70 






Glu 


Ser 


Leu 


Leu 


85 








Cys 


Val 


Lys 


Gin 


Phe 


Ala: 


Ala 


Phe 








120 


Leu 


Leu 


Glu 


Thr 






135 




Ser 


Asn 


Leu 


Glu 




150 






Glu 


Ala 


Glu 


Cys 


165 








Asn 


His 


Val 


Gin 


Gr-lU 


Val 


Ser 


Leu 








200 


Lys 


Asp 


Val 


Asp 






215 




Val 


Glu 


Ala 


Leu 




230 






Glu 


Glu 


He 


Arg 


245 








Val 


Lys 


Leu 


Asp 


Glu 


He 


Lys 


Ala 








280 


Ala 


Glu 


Ser 


Trp 






295 




He 


Arg 


His 


Gly 




310 






Leu 


Asn 


Arg 


Met 


325 








Cys 


Gin 


Asn 


Ser 


Gly 


Glu 


Ala 


Ala 








360 


Gly 


Ala 


Leu 


Gin 






375 




Tyr 


Gin 


Glu 


Val 




390 






Thr 


Tyr 


Arg 


Arg 


405 








Val 


Gly 


Ser 


Val 


Cys 


Gly 


Asp 


Leu 








440 


Val 


Ser 


Ser 


Val 






455 




Ala 


Cys 


Ala 


Pro 




470 






Arg 


Cys 






485 
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Gly 


Arg 


Ala 


Phe 




10 






Cys 


Cys 


He 


Thr 


25 








Leu 


Thr 


Gly 


Gly 




Ser 


Cys 


vsJLy 








60 


Pro 


Ser 


Pro 


Pro 






75 




Thr 


Pro 


Leu 


Asn 




90 






Glu 


Glu 


Lys 


Glu 


105 








He 


Asp 


Lys 


Val 


Lys 


Leu 


Gin 


Pne 








140 


Pro 


Leu 


Phe 


Glu 






155 




Val 


Glu 


Ala 


Asp 




170 






Glu 


Val 


Leu 


Glu 


185 








Arg 


Ala 


Thr 


Ala 


Cys 


ax a 


- 

Tyr 


Leu 








220 


He 


Gin 


Glu 


He 






235 




Val 


Leu 


Gin 


Ser 




250 






Asn 


Ser 


Arg 


Asp 


265 








Gin 


Tyr 


Asp 


Asp 


Tyr 


Arg 


Ser 


Lys 








300 


Glu 


Thr 


Leu 


Arg 






315 




He 


Gin 


Arg 


Leu 




330 






Lys 


Leu 


Glu 


Ala 


345 








Leu 


Ser 


Asp 


Ala 


Lys 


TV 1 —i 

Ala 


Lys 


Gin 








360 


Met 


Asn 


Ser 


Lys 






395 




Leu 


Leu 


Glu 


Gly 




410 






Asn 


Val 


Cys 


Val 


425 








Cys 


Ala 


Ser 


Thr 


Pro 


Ser 


Asn 


Ser 








460 


Ser 


Ala 


Arg 


Val 



475 



Ser 


Cys 


He 


Ser 






15 




Ala 


Ala 


Pro 


Tyr 




30 






Phe 


Gly 


Ser 


His 


45 








Arg 


Ser 


Phe 


Gly 


Cys 


j. j.e 


Thr 


rpl_ 

lnr 








80 


Leu 


Glu 


He 


Asp 






95 




Gin 


He 


Lys 


Ser 




110 






Arg 


Phe 


Leu 


Glu 


125 








Tyr 


Gin 


Asn 


Arg 


Gly 


Tyr 


lie 


Glu 








160 


Ser 


Gly 


Arg 


Leu 






175 




Gly 


Tyr 


Lys 


Lys 




190 






Glu 


Asn 


Glu 


Phe 


205 








Arg 


Lys 


Ser 


Asp 


Asp 


Phe 


Leu 


Arg 








240 


His 


He 


Ser 


Asp 






255 




Leu 


Asn 


Met 


Asp 




270 






He 


Val 


Thr 


Arg 


285 








Cys 


Glu 


Glu 


Met 


Arg 


Thr 


Lys 


Glu 








320 


Thr 


Ala 


Glu 


Val 






335 




Ala 


Val 


Ala 


Gin 




350 






Arg 


Cys 


Lys 


Leu 


365 








Asp 


Met 


Ala 


Cys 


Leu 


Gly 


Leu 


Asp 








400 


Glu 


Glu 


Gin 


Arg 






415 




Ser 


Ser 


Ser 


Arg 




430 






Thr 


Ala 


Pro 


Val 


445 








Asn 


Val 


Val 


Val 


Gly 


Val 


Cys 


Gly 








480 



<211> ill 
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<212> PRT 

<213> Homo sapiens 



<400> 444 

Met Lys Ala Thr Val lie Trp His Gly Glu Thr Val Gly Cys Thr Lys 

15 10 is 

Glu Glu lie Lys Glu Leu Thr His Met lie Gin Arg Leu Met Ala Lys 

20 25 " 30 

Val Glu Asn Ala Lys Cys Gin Val Trp Gly He Cys Ala Gin Gly Gin 

35 40 45 

Arg Asp Leu Trp Pro Asn Leu Cys His Thr Ala Tyr Val Cys Pro Thr 

50 55 60 

Trp He Ser Ala Phe He Leu Gin Ser Leu Cys Pro Cys Arg Val Pro 
65 70 75 " ~ 80 

Gly Cys Gly Gin Ser Gly Ser Ala Arg Met Met Lys Ala Arg Gly Leu 

85 90 95 

Phe Leu Arg Cys Ser Gin Leu Asn Gly Arg Leu Asp He Phe Arg 
100 105 ^ no 

<210> 445 
<211> 505 
<212> PRT 
<213> Homo sapiens 



<400> 445 

Met Thr Cys Gly Ser Gly Phe Gly Gly Arg Ala Phe Ser Cys He Ser 

15 10 is 

Ala Cys Gly Pro Arg Pro Gly Arg Cys Cys He Thr Ala Ala Pro Tyr 

20 25 30 

Arg Gly He Ser Cys Tyr Arg Gly Leu Thr Gly Gly Phe Gly Ser His 

35 40 45 

Ser Val Cys Gly Gly Phe Arg Ala Gly Ser Cys Gly Arg Ser Phe Gly 

50 55 60 

Tyr Arg Ser Gly Gly Val Cys Gly Pro Ser Pro Pro Cys He Thr Thr 
65 70 75 80 

Val Ser Val Asn Glu Ser Leu Leu Thr Pro Leu Asn Leu Glu He Asp 

85 90 95 

Pro Asn Ala Gin Cys Val Lys Gin Glu Glu Lys Glu Gin He Lys Ser 

100 105 . no 

Leu Asn Ser Arg Phe Ala Ala Phe He Asp Lys Val Arg Phe Leu Glu 

115 120 125 

Gin Gin Asn Lys Leu Leu Glu Thr Lys Leu Gin Phe Tyr Gin Asn Arg 

130 135 140 

Glu Cys Cys Gin Ser Asn Leu Glu Pro Leu Phe Glu Gly Tyr He Glu 
* 45 150 155 ~ 160 

Thr Leu Arg Arg Glu Ala Glu Cys Val Glu Ala Asp Ser Gly Arg Leu 

165 170 ~ 175 

Ala Ser Glu Leu Asn His Val Gin Glu Val Leu Glu Gly Tyr Lys Lys 

180 185 190 ? 

Lys Tyr Glu Glu Glu Val Ser Leu Arg Ala Thr Ala Glu Asn Glu Phe 

195 200 205 

Val Ala Leu Lys Lys Asp Val Asp Cys Ala Tyr Leu Arg Lys Ser Asp 

210 215 220 

Leu Glu Ala Asn Val Glu Ala Leu He Gin Glu He Asp Phe Leu Arg 
225 230 235 240 

Arg Leu Tyr Glu Glu Glu He Arg He Leu Gin Ser His He Ser Asp 
245 250 255 
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Thr Ser Val Val Val Lys Leu Asp Asn Ser Arg Asp Leu Asn Met Asp 

260 265 270 

Cys lie lie Ala Glu lie Lys Ala Gin Tyr Asp Asp lie Val Thr Arg 

275 280 285 

Ser Arg Ala Glu Ala Glu Ser Trp Tyr Arg Ser Lys Cys Glu Glu Met 

290 295 300 

Lys Ala Thr Val lie Arg His Gly Glu Thr Leu Arg Arg Thr Lys Glu 
305 310 315 320 

Glu lie Asn Glu Leu Asn Arg Met He Gin Arg Leu Thr Ala Glu Val 

325 330 335 

Glu Asn Ala Lys Cys Gin Asn Ser Lys Leu Glu Ala Ala Val Ala Gin 

340 345 350 

Ser Glu Gin Gin Gly Glu Ala Ala Leu Ser Asp Ala Arg Cys Lys Leu 

355 360 365 

Ala Glu Leu Glu Gly Ala Leu Gin Lys Ala Lys Gin Asp Met Ala Cys 

370 375 380 

Leu He Arg Glu Tyr Gin Glu Val Met Asn Ser Lys Leu Gly Leu Asp 
385 390 395 400 

He Glu He Ala Thr Tyr Arg Arg Leu Leu Glu Gly Glu Glu Gin Arg 

405 410 415 

Leu Cys Glu Gly He Gly Ala Val Asn Val Cys Val Ser Ser Ser Arg 

420 425 430 

Gly Gly Val Val Cys Gly Asp Leu Cys Val Ser Gly Ser Arg Pro Val 

435 440 445 

Thr Gly Ser Val Cys Ser Ala Pro Cys Asn Gly Asn Val Ala Val Ser 

450 455 460 

Thr Gly Leu Cys Ala Pro Cys Gly Gin Leu Asn Thr Thr Cys Gly Gly 
465 470 475 480 

Gly Ser Cys Gly Val Gly Ser Cys Gly He Ser Ser Leu Gly Val Gly 

485 490 495 

Ser Cys Gly Ser Ser Cys Arg Lys Cys 
500 505 

<210> 446 
<211> 486 
<212> PRT 
<213> Homo sapiens 



<400> 446 






















Met 


Thr 


Cys Gly 


Ser 


Tyr 


Cys 


Gly Gly Arg Ala Phe Ser 


Cys 


He 


Ser 


1 






5 








10 








15 




Ala 


Cys 


Gly Pro 


Arg 


Pro 


Gly Arg Cys 


Cys 


He 


Thr Ala 


Ala 


Pro 


Tyr 






20 








25 








30 






Arg 


Gly 


He Ser 


Cys 


Tvr Arg Glv Leu Thr Gly Gly Phe 


Gly 


Ser 


His 






35 








40 






45 








Ser 


Val 


Cys Gly 


Gly 


Phe 


Arg Ala Gly 


Ser 


Cys 


Gly Arg 


Ser 


Phe 


Gly 




50 








55 








60 








Tyr 


Arg 


Ser Gly 


Gly Val 


Cys 


Gly Pro 


Ser 


Pro 


Pro Cys 


He 


Thr 


Thr 


65 








70 








75 








80 


Val 


Ser 


Val Asn 


Glu 


Ser 


Leu 


Leu Thr 


Pro 


Leu 


Asn Leu 


Glu 


He 


Asp 








85 








90 








95 




Pro 


Asn 


Ala Gin 


Cys 


Val 


Lys 


Gin Glu 


Glu 


Lys 


Glu Gin 


He 


Lys 


Ser 






100 








105 








110 






Leu 


Asn 


Ser Arg 


Phe 


Ala 


Ala 


Phe He 


Asp 


Lys 


Val Arg 


Phe 


Leu 


Glu 






115 








120 






125 








Gin 


Gin 


Asn Lys 


Leu 


Leu 


Glu 


Thr Lys 


Leu 


Gin 


Phe Tyr 


Gin 


Asn 


Arg 




130 






135 








140 








Glu 


Cys 


Cys Gin 


Ser 


Asn 


Leu 


Glu Pro 


Leu 


Phe Glu Gly Tyr 


He 


Glu 


145 








150 








155 








160 


Thr 


Leu 


Arg Arg 


Glu 


Ala 


Glu 


Cys Val 


Glu 


Ala 


Asp Ser Gly Arg Leu 



165 170 175 
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Ala 


Ser 


Glu 


Leu 


Asn 


His 


Val 


Gin 








180 










Lys 


Tyr 


Glu 


Glu 


Glu 


Val 


Ser 


Leu 






195 










200 


Val 


Ala 


Leu 


Lys 


Lys 


Asp 


Val 


Asp 




210 










215 




Leu 


Glu 


Ala 


Asn 


Val 


Glu 


Ala 


Leu 


225 










230 






Arg 


Leu 


Tyr 


Glu 


Glu 


Glu 


He 


Arg 










245 








Thr 


Ser 


Val 


Val 


Val 


Lys 


Leu 


Asp 








260 










Cys 


He 


He 


Ala 


Glu 


He 


Lys 


Ala 






275 










280 


Ser 


Arg 


Ala 


Glu 


Ala 


Glu 


Ser 


Trp 




290 










295 




Lys 


Ala 


Thr 


Val 


He 


Arg 


His 


Gly 


305 










310 






Glu 


He 


Asn 


Glu 


Leu 


Asn 


Arg 


Met 










325 








Glu 


Asn 


Ala 


Lys 


Cys 


Gin 


Asn 


Ser 








340 










Ser 


Glu 


Gin 


Gin Gly 


Glu 


Ala 


Ala 






355 










360 


Ala 


Glu 


Leu 


Glu Gly 


Ala 


Leu 


Gin 




370 










375 




Leu 


lie 


Arg 


Glu Tyr 


Gin 


Glu 


Val 


385 










390 






He 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 


Arg 










405 








Leu 


Cys 


Glu 


Gly Val 


Gly 


Ser 


val 








420 










Gly 


Gly 


Val 


Val 


Cys 


Gly 


Asp 


Leu 






435 










440 


Val 


Ser 


Thr 


Arg Val 


Ser 


Ser 


Val 




450 










455 




Gly 


Thr 


Thr 


Asn 


Ala 


Cys 


Ala 


Pro 


465 










470 






Gly 


Ser 


Cys 


Lys 


Arg 


Cys 














485 








<210> 447 












<211> 493 












<212> PRT 












<213> Homo 


sapiens 








<400> 447 












Met 


Thr 


Cys 


Gly 


Phe 


Asn 


Ser 


He 


1 








5 








Phe 


Ser 


Cys 


Val 


Ser 


Ala 


Cys 


Gly 








20 










Thr 


Ala 


Ala 


Pro 


Tyr 


Arg 


Gly 


He 






35 










40 


Gly 


Phe 


Gly 


Ser 


His 


Ser 


Val 


Cys 




50 










55 




Gly 


Arg 


Ser 


Phe 


Gly 


Tyr 


Arg 


Ser 


65 










70 






Pro 


Cys 


He 


Thr 


Thr 


Val 


Ser 


Val 










85 








Asn 


Leu 


Glu 


He 


Asp 


Pro 


Asn 


Ala 



100 
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Glu 


Val 


Leu 


Glu 


Gly 


Tyr 


Lys 


Lys 


185 










190 






Arg 


Ala 


Thr 


Ala 


Glu 


Asn 


Glu 


Phe 










205 








Cys 


Ala 


Tyr 


Leu 


Arg 


Lys 


Ser 


Asp 








220 










He 


Gin 


Glu 


He 


Asp 


Phe 


Leu 


Arg 






235 










240 


Val 


Leu 


Gin 


Ser 


His 


He 


Ser 


Asp 




250 










255 


Asn 


Ser 


Arg 


Asp 


Leu 


Asn 


Met 


As P 


265 










270 






Gin 


Tyr 


Asp 


Asp 


He 


Val 


Thr 


Arg 










285 








Tyr 


Arg 


Ser 


Lys 


Cys 


Glu 


Glu 


Met 








300 










Glu 


Thr 


Leu 


Arg 


Arg 


Thr 


Lys 


Glu 






315 










320 


He 


Gin 


Arg 


Leu 


Thr 


Ala 


Glu 


Val 




330 










335 




Lys 


Leu 


Glu 


Ala 


Ala 


Val 


Ala 


Gin 


345 










350 






Leu 


Ser 


Asp 


Ala 


Arg 


Cys 


Lys 


Leu 










365 








Lys 


Ala 


Lys 


Gin 


Asp 


Met 


Ala 


Cys 








380 










Met 


Asn 


Ser 


Lys 


Leu 


Gly 


Leu 


Asp 






395 










400 


Leu 


Leu 


Glu 


Gly 


Glu 


Glu 


Gin 


Arg 




410 










415 


Asn 


Val 


Cys 


Val 


Ser 


Ser 


Ser 


Arg 


425 










430 




Cys 


Ala 


Ser 


Thr 


Thr 


Ala 


Pro 


Val 










445 








Pro 


Ser 


Asn 


Ser 


Asn 


Val 


Val 


Val 








460 










Ser 


Ala 


Arg 


Val 


Gly 


Val 


Cys 


Gly 






475 










480 


Gly 


Cys 


Gly 


Phe 


Arg 


Pro 


Gly 


Asn 




10 










15 




Pro 


Arg 


Pro 


Ser 


Arg 


Cys 


Cys 


He 


25 










30 






Ser 


Cys 


Tyr 


Arg 


Gly 


Leu 


Thr 


Gly 










45 








Gly 


Gly 


Phe 


Arg 


Ala 


Gly 


Ser 


Cys 








60 










Gly 


Gly 


Val 


Cys 


Gly 


Pro 


Ser 


Pro 






75 










80 


Asn 


Glu 


Ser 


Leu 


Leu 


Thr 


Pro 


Leu 




90 










95 




Gin 


Cys 


Val 


Lys 


Gin 


Glu 


Glu 


Lys 


105 








110 
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Glu Gin 


lie 


Lys 


Ser 


Leu 


Asn 


Ser 




115 










120 


Val Arg 


Phe 


Leu 


Glu 


Gin 


Gin 


Asn 


130 










135 




Phe Tyr 


Gin 


Asn 


Cvs 

\»y 0 


Glu 


Cys 


Cvs 


145 








150 






Ala Gly 


Tyr 


He 


Glu 


Thr 


Leu 


Arc? 








165 








Asp Ser 


Gly 


Arg 


Leu 


Ala 


Ser 


Glu 






180 










Glu Gly 


Tvr 


Lys 


Lys 


Lvs 


Tvr 

* j -** 


Glu 




195 










200 


Ala Glu 


Asn 


Glu 


Phe 


Val 


Ala 


Leu 


210 

A X W 










215 




Leu Arg 


Lys 


Ser Asp 


Leu 


Glu 


Ala 


9 *> R 
^ o 








2m 






Tie Asn 


Phe 


Leu 


Arg 


Arcr 


Leu 


Tvr 

xyx 








245 








Ser His 


lie 


Ser 


Asp 


Thr 


Ser 


Val 






260 










7\ on T.f=»n 
nop UCU 


7\ en 


Met 


Asp 


Cys 


Tie 

X X G 


Val 

VAX 




77C 

A / 9 










280 

A O \f 


2i Q'n Tip 
nop xxc 


AT a 


Thr Arg 


Car 
OCX 


Axy 


Ala 


2 Qn 

& Z> \J 










^ ZJ 




Lys Cys 


Glu 


Glu 


Met 


Lys 


Ala 


Thr 
















Ara Ara 
ax. y nx y 


Thr 
xhx 


Lys 


Glu 


Glu 


Tie 

X xc 










325 








Leu. Thr 


Ala 


Glu 


Val 


Glu 


Asn 


Ala 






340 










Ala Ala 


Val 

V ctx 


Ala 


Gin 


OCX 


Glu 

wX \X 


Gin 




J 3 -3 












Ala Ayg 


Cys 


Lys 


Leu 


Ala 


Glu 


Leu 












375 




Gin Ann 


Met 


Ala 


Cys 


Leu 


He 


Ay* a 


^ ft c; 

O O J 








390 






Xjyo UCU 


OX jr 


Leu 


Asp 


He 


Glu 


X J.C 








405 








Glv Glu 


Glu 


Gin Arg 


Leu 


Cys 


Glu 

V7X U. 






420 










Val Ser 


Ser 


Ser 


Arg 


Gly Gly 


Val 




435 










440 


Gly Ser 


Arg 


Pro 


Val 


Thr 


Gly 


Ser 


450 










455 




Asn Leu 


Val 


Val 


Ser 


Thr Gly 


Leu 


465 








470 






Thr Thr 


Cys 


Gly Gly 


Gly Ser 


Cys 








485 









<210> 448 



<211> 143 
<212> PRT 
<213> Homo sapiens 



<400> 448 

Met Ala Ser Gin Ser Cys His He 
1 5 
Phe Ser Ser Arg Ser Ala Thr Val 
20 

Val Ser Ala Met Ala His His Gly 
35 40 



-252- 



Arcr 


Phe 


Ala 


Ala 


Phe 


He 


Asd 


Lvs 










125 








Lvs 

xiy 0 


Leu 


Leu 


Glu 


Thr 


Lvs 


Leu 


Gin 








140 










Gin 


Ser 


Asn 


Leu 


Glu 


Pro 


Leu 


Phe 






155 










160 


Ara 


Glu 


Ala 


Glu 


Cvs 

\-*y 0 


Val 


Glu 


Ala 




170 










175 




Leu 


Asn 


His 


Val 


Gin 


Glu 


Val 


Leu 


185 










190 






Glu 


Glu 


Val 


Ala 


Leu 


Ara 


Ala 


Thr 










205 








Lys 


Lvs 

xiy a 


Asp 


Val 


Asp 


Cvs 


Ala 


Tvr 


















Asn 


Val 


Glu 


Ala 


Leu 


He 


Gin 


Glu 






2 "\ ^ 

« j j 












Glu 


Glu 


Glu 


He 


Ara 


He 


Leu 


Gin 




2 

A 3 SJ 














Val 


Val 


Lys 


Leu 


Asp 


Asn 


Ser 


Ara 












2 70 






Ala 


Glu 


He 


Lys 


Ala 


Gin 


Tvr 

Ay x 


Asp 










285 








Glu 


Ala 


Glu 


Ser 




Tvr* 
xyx 


Ara 


Ser 


















Val 


He 


Ara 
«xy 


His 


Glv 

vaxy 


Glu 


Thr 


Leu 






11c 

Ol3 










ion 


Gl 11 


t.oi 1 

11CU 


Zk on 
noil 


/\rg 




Tl e 

X X c 


Gl n 


Arg 




'X'k n 










^ ^ R 

J J J 




Lys 


v-y» 


Gin 


Asn 


Ser 


Lys 


Leu 


Glu 


■5/1 C 










JDU 






Gl ri 


Gl \r 


Gl u 

OX UL 


AT a 


AT a 

nlCl 


UC Li 


OCi. 












J D O 








Glu 


Glv 
vaxy 


Ala 


Leu 


Gin 


Lys 


Ala 


Lvs 
xiy a 








inn 

•SOU 










Glu 


xyx 


Gin 


Glu 


Val 


Met 


Asn 


Ser 
















Ann 


Ala 


Thr 


Tvt* 
xyx 


Ara 


Ara 


Leu 


Leu 


Glu 




*X X \J 














Glv 

vjxy 


Val 


Glu 


Ala 


Val 


Asn 


Val 


Cys 


425 










430 






Val 


Cys 


Gly 


Asp 


Leu 


Cys 


Val 


Ser 










445 








Val 


Cys 


Ser 


Ala 


Pro 


Cys 


Asn 


Gly 








460 










Cys 


Lys 


Pro 


Cys 


Gly 


Gin 


Leu 


Asn 






475 










480 


Gly 


Gin 


Gly 


Arg 


Tyr 










490 














Ser 


Ser 


Gly 


Cys 


Gly 


Val 


Lys 


Asn 




10 










15 




Pro 


Lys 


Pro 


Gly 


Tyr 


His 


Ser 


Cys 


25 










30 






Val 


Ser 


Pro 


Gly 


Gly 


Leu 


Gly 


Ser 



45 
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Arg 


Arg 


Leu 


Gly Gly Phe Gly Ser 




50 




55 


Pro 


Arg 


He 


Ala Val Ser Cys Arg 


65 






70 


Phe 


Gly 


Tyr 


Trp Ala Gly Gly Leu 








85 


Thr 


Ser 


Val 


Thr He Asn Glu Ser 








100 


He 


As P 


Pro 


Asn Ala Gin Cys Val 






115 


120 


Arg 


Cys 


Leu 


Asn Lys Phe Ala Ala 




130 




135 



<210> 449 



<211> 507 
<212> PRT 
<213> Homo sapiens 



-253- 

Gln Ser Leu Cys Thr Val Gly Ser 
60 

Trp Pro Leu His Ser Arg Gly Arg 

75 80 
Cys Arg Pro Ser Pro Pro Arg He 

90 95 
Leu Leu Met Pro Leu Asn Leu Glu 
105 110 
Lys His Glu Glu Lys Glu His He 
125 

Phe He Asp Lys Val Gly Leu 
140 



<400> 449 

Met Ser Cys Arg Ser Tyr Arg He Ser Ser Gly Cys Gly Val Thr Arg 

1 5 10 15 

Asn Phe Ser Ser Cys Ser Ala Val Ala Pro Lys Thr Gly Asn Arg Cys 

20 25 * 30 

Cys He Ser Ala Ala Pro Tyr Arg Gly Val Ser Cys Tyr Arg Gly Leu 

35 40 45 

Thr Gly Phe Gly Ser Arg Ser Leu Cys Asn Leu Gly Ser Cys Gly Pro 

50 55 60 

Arg He Ala Val Gly Gly Phe Arg Ala Gly Ser Cys Gly Arg Ser Phe 
65 70 75 80 

Gly Tyr Arg Ser Gly Gly Val Cys Gly Pro Ser Pro Pro Cys He Thr 

85 90 95 

Thr Val Ser Val Asn Glu Ser Leu Leu Thr Pro Leu Asn Leu Glu He 

100 105 no 

Asp Pro Asn Ala Gin Cys Val Lys Gin Glu Glu Lys Glu Gin He Lys 

115 120 125 

Ser Leu Asn Ser Arg Phe Ala Ala Phe He Asp Lys Val Arg Phe Leu 

130 135 140 

Glu Gin Gin Asn Lys Leu Leu Glu Thr Lys Trp Gin Phe Tyr Gin Asn 
145 150 155 160 

Gin Arg Cys Cys Glu Ser Asn Leu Glu Pro Leu Phe Ser Gly Tyr He 

165 170 175 

Glu Thr Leu Arg Arg Glu Ala Glu Cys Val Glu Ala Asp Ser Gly Arg 

180 185 190 

Leu Ala Ser Glu Leu Asn His Val Gin Glu Val Leu Glu Gly Tyr Lys 

195 200 205 

Lys Lys Tyr Glu Glu Glu Val Ala Leu Arg Ala Thr Ala Glu Asn Glu 

210 215 220 

Phe Val Val Leu Lys Lys Asp Val Asp Cys Ala Tyr Leu Arg Lys Ser 
225 230 235 240 

Asp Leu Glu Ala Asn Val Glu Ala Leu Val Glu Glu Ser Ser Phe Leu 

245 250 255 

Arg Arg Leu Tyr Glu Glu Glu He Arg Val Leu Gin Ala His He Ser 

260 265 270 

Asp Thr Ser Val He Val Lys Met Asp Asn Ser Arg Asp Leu Asn Met 

275 280 285 

Asp Cys He He Ala Glu He Lys Ala Gin Tyr Asp Asp Val Ala Ser 

290 295 300 

Arg Ser Arg Ala Glu Ala Glu Ser Trp Tyr Arg Ser Lys Cys Glu Glu 
305 310 315 320 

Met Lys Ala Thr Val He Arg His Gly Glu Thr Leu Arg Arg Thr Lys 
325 330 ^ 335 
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Glu 


Glu 


He 


Asn 


Glu 


Leu 


Asn 


Ara 








340 










lie 


Glu 


Asn 


Ala 


Lvs 


Cvs 

^ Jr •** 


Gin 


Aro 






355 










360 


Glu 


Ala 


Glu 


Gin 


Gin 


Gly 


Glu 


Ala 




370 










375 




Leu 


Ala 


Glu 


Leu 


Glu 


Gly 


Ala 


Leu 


385 










390 






Cys 


Leu 


Leu 


Lvs 


Glu 


Tvr 

JT 


Gin 


Glu 










405 








Asp 


He 


Glu 


He 


Ala 


Thr 


Tvr 

A Jr *■ 


Aro 








420 












Leu 


Pvs 
v-y & 


Glu 


Glv 


Val 


Glv 


Ser 






*X O 3 










tit w 


Arg 


Gly 


Gly 


Val 


Ser 


Cys 


Gly 


Gly 




450 










455 




Arg 


Gin 


He 


Thr 


Ser 


Gly 


Pro 


Ser 


465 










470 






Val 


Ala 


Pro 


Asp 


Ser 


Cys 


Ala 


Pro 










485 








Ser 


Cys 


Gly 


Ser 


Ser 


Arg 


Ser 


Val 








500 










<210> 450 












<211> 600 












<212> PRT 












<213> Homo 


sapiens 








<400> 450 












Met 


Ser 


Cys 


Arg 


Ser 






v ai 


1 








5 








Phe 


Ser 


Ser 


Cys 


Ser 


Ala 


Met 


Thr 








20 










Ala 


Asn 


Ser 


Val 


Ser 






OC1 






35 










4U 


Ser 


Phe 


Gly 


Ser 


Arg 


Ser 


val 


j. ±e 




50 










DO 




He 


Ala 


Ala 


Val 


Gly 




Arg 


rr X. U 


65 










/ U 






Ala 


Gly 


Cys 


Gly 


Met 


<j±y 


rue 


V3X y 










85 








Pro 


Arg 


Ala 


Asp 


Ser 


Cys 


Val 


Glv 








100 










He 


Gly 


Tyr 


Gly 


Phe 


Glv 


Glv 


Pro 






115 










1 ? 0 

J. *S \J 


Val 


Gly 


Val 


Pro 


Ala 


Ala 


Pro 

XT -L. w 


Ser 




130 














Ser 


Leu 


Leu 


Thr 


Pro 


T.pu 


7\ c? n 
noil 


LIC U. 


145 










■Ljv 






Val 


Lys 


Lys 


Asp 


Glu 


Lys 


Glu 


Gin 










165 








Ala 


Ser 


Phe 


He 


Asp 


Lys 


Val 


Arg 








180 










Leu 


Glu 


Thr 


Lys 


Trp 


Ser 


Phe 


Leu 






195 










200 


Asn 


Leu 


Glu 


Pro 


Leu 


Phe 


Glu 


Ser 




210 










215 




Leu 


Glu 


Val 


Leu 


Val 


Ser 


Asp 


Gin 


225 










230 






His 


Leu 


Gin 


Asp 


Val 


Leu 


Glu 


Gly 



245 



-254- 



Met 


He 


Gin 


Arcr 


Leu 


Thr 


Ala 


Glu 


345 










350 






Ala 


Lys 


Leu 


Glu 


Ala 


Ala 


Val 


Ala 








365 








Ala 


Leu 


Ser 


Asn 


Ala 


Ara 

***** 3 


Cvs 


Lvs 








380 










Gin 


Lvs 


Ala 


Lvs 


Gin 


ASP 


Met 


Ala 






395 










400 


Val 


Met 


Asn 


Ser 


Lvs 


Leu 


Gly 
******* 


Leu 




410 










415 




Arcr 


Leu 


Leu 


Glu 


Glv 


Glu 


Glu 


His 


425 










430 






Val 


Asn 


Val 


Cvs 

w jr » 


Val 


Ser 


Ser 


Ser 










445 








Leu 


Ser 


Tyr 


Ser 


Thr 


Thr 


Pro 


Gly 








460 










Ala 


He 


Gly 


Gly 


Ser 


He 


Thr 


Val 






475 










480 


Cys 


Gin 


Pro 


Arg 


Ser 


Ser 


Ser 


Phe 




490 










495 




Arg 


Phe 


Ala 












505 
















Ser 


Ser 


Glv 


His 


Arg 


Val 


Glv 


Asn 




10 










15 




Pro 


Gin 


Asn 


Leu 


Asn 




Phe 


Ara 




















Pro 


Glv 


Phe 


Arg 


Glv 


Leu 


Glv 










45 














Ser Tyr 


OCX. 


XT X *J 










60 










He 


His 


Cys 


Gly Val 




Phe 


Glv 






/ D 












Asp 


Glv 


Ara 


Gly Val 


Glv 


Leu 


Glv 




90 










95 




Leu 


Glv 


Phe 


Gly Ala 


Glv 


Ser 


Glv 

jr 


JL VJ _} 










HO 

J. X V 






Glv 

vj-i- y 


Phe 


Glv 


Tyr 


Arg 


Val 


Glv 


Glv 
WJ - y 










125 








He 


Thr 


Ala 


Val 


Thr 


Val 


Asn 


Lys 








140 










Glu 


He 


Asp 


Pro 


Asn 


Ala 


Gin 


Ara 






155 










160 


He 


Lvs 


Thr 


Leu 


Asn 


Asn 


Lvs 


Phe 




170 










175 




Phe 


Leu 


Glu 


Gin 


Gin 


Asn 


Lys 


Leu 


185 










190 






Gin 


Glu 


Gin 


Lys 


Cys 


He 


Arg 


Ser 










205 








Tyr 


He 


Thr 


Asn 


Leu 


Arg 


Arg 


Gin 








220 










Ala 


Arg 


Leu 


Gin 


Ala 


Glu 


Arg 


Asn 






235 










240 


Phe 


Lys 


Lys 


Lys 


Tyr 


Glu 


Glu 


Glu 




250 










255 
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Val Val Cys Arg Ala Asn Ala Glu Asn Glu Phe Val Ala Leu Lys Lys 

260 265 270 

Asp Val Asp Ala Ala Phe Met Asn Lys Ser Asp Leu Glu Ala Asn Val 

275 280 285 

Asp Thr Leu Thr Gin Glu He Asp Phe Leu Lys Thr Leu Tyr Met Glu 

290 295 300 

Glu He Gin Leu Leu Gin Ser His lie Ser Glu Thr Ser Val He Val 
305 310 315 320 

Lys Met Asp Asn Ser Arg Asp Leu Asn Leu Asp Gly He He Ala Glu 

325 330 335 

Val Lys Ala Gin Tyr Glu Glu Val Ala Arg Arg Ser Arg Ala Asp Ala 

340 345 ~ 350 

Glu Ala Trp Tyr Gin Thr Lys Tyr Glu Glu Met Gin Val Thr Ala Gly 

355 360 365 

Gin His Cys Asp Asn Leu Arg Asn He Arg Asn Glu He Asn Glu Leu 

370 375 380 

Thr Arg Leu He Gin Arg Leu Lys Ala Glu He Glu His Ala Lys Ala 
385 390 395 400 

Gin Arg Ala Lys Leu Glu Ala Ala Val Ala Glu Ala Glu Gin Gin Gly 

405 410 415 

Glu Ala Thr Leu Ser Asp Ala Lys Cys Lys Leu Ala Asp Leu Glu Cys 

420 425 430 

Ala Leu Gin Gin Ala Lys Gin Asp Met Ala Arg Gin Leu Cys Glu Tyr 

435 440 ~ 445 

Gin Glu Leu Met Asn Ala Lys Leu Gly Leu Asp He Glu He Ala Thr 

450 455 460 

Tyr Arg Arg Leu Leu Glu Gly Glu Glu Ser Arg Leu Cys Glu Gly Val 
465 470 475 ~ 480 

Gly Pro Val Asn He Ser Val Ser Ser Ser Arg Gly Gly Leu Val Cys 

485 490 495 

Gly Pro Glu Pro Leu Val Ala Gly Ser Thr Leu Ser Arg Gly Gly Val 

500 505 510 

Thr Phe Ser Gly Ser Ser Ser Val Cys Ala Thr Ser Gly Val Leu Ala 

515 520 525 

Ser Cys Gly Pro Ser Leu Gly Gly Ala Arg Val Ala Pro Ala Thr Gly 

530 535 540 

Asp Leu Leu Ser Thr Gly Thr Arg Ser Gly Ser Met Leu He Ser Glu 
545 550 555 560 

Ala Cys Val Pro Ser Val Pro Cys Pro Leu Pro Thr Gin Gly Gly Phe 

565 570 575 

Ser Ser Cys Ser Gly Gly Arg Ser Ser Ser Val Arg Phe Val Ser Thr 

580 585 590 

Thr Thr Ser Cys Arg Thr Lys Tyr 
595 ^ 600 

<210> 451 

<211> 513 

<212> PRT 

<213> Homo sapiens 



<400> 451 

Met Ser Tyr His Ser Phe Gin Pro 
1 5 
Phe Ser Ser Tyr Ser Ala Val Met 
20 

Val Ser Lys Gly Pro Cys Arg Pro 

35 40 
Leu Gly Cys Leu Gly Ser Arg Ser 

50 55 
Pro Arg Val Ala Ser Arg Cys Gly 
65 70 



Gly Ser Arg Cys Gly Ser Gin Ser 

10 15 
Pro Arg Met Val Thr His Tyr Ala 
25 3 0 

Gly Gly Gly Arg Gly Leu Arg Ala 
45 

Leu Cys Asn Val Gly Phe Gly Arg 
60 

Gly Thr Leu Pro Gly Phe Gly Tyr 
75 80 
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Arg Leu Gly Ala Thr Cys Gly Pro Ser Ala Cys lie Thr Pro Val Thr 

85 90 95 

lie Asn Glu Ser Leu Leu Val Pro Leu Ala Leu Glu lie Asp Pro Thr 

100 105 110 

Val Gin Arg Val Lys Arg Asp Glu Lys Glu Gin lie Lys Cys Leu Asn 

115 120 125 

Asn Arg Phe Ala Ser Phe lie Asn Lys Val Arg Phe Leu Glu Gin Lys 

130 135 140 

Asn Lys Leu Leu Glu Thr Lys Trp Asn Phe Met Gin Gin Gin Arg Cys 
145 150 155 160 

Cys Gin Thr Asn He Glu Pro He Phe Glu Gly Tyr He Ser Ala Leu 

165 170 175 

Arg Arg Gin Leu Asp Cys Val Ser Gly Asp Arg Val Arg Leu Glu Ser 

180 185 190 

Glu Leu Cys Ser Leu Gin Ala Ala Leu Glu Gly Tyr Lys Lys Lys Tyr 

195 200 2 05 

Glu Glu Glu Leu Ser Leu Arg Pro Cys Val Glu Asn Glu Phe Val Ala 

210 215 220 

Leu Lys Lys Asp Val Asp Thr Ala Phe Leu Met Lys Ala Asp Leu Glu 
225 230 235 240 

Thr Asn Ala Glu Ala Leu Val Gin Glu He Asp Phe Leu Lys Ser Leu 

245 250 255 

Tyr Glu Glu Glu He Cys Leu Leu Gin Ser Gin He Ser Glu Thr Ser 

260 265 270 

Val He Val Lys Met Asp Asn Ser Arg Glu Leu Asp Val Asp Gly He 

275 280 285 

lie Ala Glu He Lys Ala Gin Tyr Asp Asp He Ala Ser. Arg Ser Lys 

290 295 300 

Ala Glu Ala Glu Ala Trp Tyr Gin Cys Arg Tyr Glu Glu Leu Arg Val 
305 310 315 320 

Thr Ala Gly Asn His Cys Asp Asn Leu Arg Asn Arg Lys Asn Glu He 

325 330 335 

Leu Glu Met Asn Lys Leu He Gin Arg Leu Gin Gin Glu Thr Glu Asn 

340 345 350 

Val Lys Ala Gin Arg Cys Lys Leu Glu Gly Ala He Ala Glu Ala Glu 

355 360 365 

Gin Gin Gly Glu Ala Ala Leu Asn Asp Ala Lys Cys Lys Leu Ala Gly 

370 375 380 

Leu Glu Glu Ala Leu Gin Lys Ala Lys Gin Asp Met Ala Cys Leu Leu 
385 390 395 400 

Lys Glu Tyr Gin Glu Val Met Asn Ser Lys Leu Gly Leu Asp He Glu 

405 410 415 

He Ala Thr Tyr Arg Arg Leu Leu Glu Gly Glu Glu His Arg Leu Cys 

420 425 430 

Glu Gly He Gly Pro Val Asn He Ser Val Ser Ser Ser Lys Gly Ala 

435 440 445 

Phe Leu Tyr Glu Pro Cys Gly Val Ser Thr Pro Val Leu Ser Thr Gly 

450 455 460 

Val Leu Arg Ser Asn Gly Gly Cys Ser He Val Gly Thr Gly Glu Leu 
465 470 475 480 

Tyr Val Pro Cys Glu Pro Gin Gly Leu Leu Ser Cys Gly Ser Gly Arg 

485 490 495 

Lys Ser Ser Met Thr Leu Gly Ala Gly Gly Ser Ser Pro Ser His Lys 
500 505 510 



His 




<210> 


452 


<211> 


85 


<212> 


PRT 



<213> Homo sapiens 
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<400> 452 

Met Asp Ala Val Tyr Met Asn Lys Val Gly Leu Glu Ala Lys Val Asd 

\ 5 io 15 

Ala Leu Met Glu Glu Thr Asn Phe Leu Ser Thr Phe Tyr Lys Ala Val 

20 25 30 

Arg Val Pro Gly Ala Pro Ser Asn Arg Gly Ala Gly Gly Trp Val Leu 

35 40 45 

Glu Pro Gin Leu Gly Thr Glu Pro Val Gly Ser Phe Pro Gly Leu Leu 

50 55 60 

Ser Ala Pro Tyr Pro Thr Cys Val Leu Gin Gly Arg Cys His Phe Pro 
65 70 75 80 

Tyr His Arg Arg Lys 
85 

<210> 453 
<211> 564 
<212> PRT 
<213> Homo sapiens 



<400> 453 

Met Ala Ser Thr Ser Thr Thr lie Arg Ser His Ser Ser Ser Arcr Arq 

15 io 15 

Gly Phe Ser Ala Ser Ser Ala Arg Leu Pro Gly Val Ser Arg Ser Gly 

20 25 30 

Phe Ser Ser lie Ser Val Ser Arg Ser Arg Gly Ser Gly Gly Leu Gly 

35 40 45 

Gly Ala Cys Gly Gly Ala Gly Phe Gly Ser Arg Ser Leu Tyr Gly Leu 

50 55 60 

Gly Gly Ser Lys Arg He Ser He Gly Gly Gly Ser Cys Ala He Ser 
65 70 75 80 

Gly Gly Tyr Gly Ser Arg Ala Gly Gly Ser Tyr Gly Phe Gly Gly Ala 

85 90 95 

Gly Ser Gly Phe Gly Phe Gly Gly Gly Ala Gly He Gly Phe Gly Leu 

100 105 no 

Gly Gly Gly Ala Gly Leu Ala Gly Gly Phe Gly Gly Pro Gly Phe Pro 

115 120 125 

Val Cys Pro Pro Gly Gly He Gin Glu Val Thr Val Asn Gin Ser Leu 

130 135 140 

Leu Thr Pro Leu Asn Leu Gin He Asp Pro Ala He Gin Arg Val Arg 
I 45 150 155 160 

Ala Glu Glu Arg Glu Gin He Lys Thr Leu Asn Asn Lys Phe Ala Ser 

165 170 175 

Phe He Asp Lys Val Arg Phe Leu Glu Gin Gin Asn Lys Val Leu Asp 

180 185 190 

Thr Lys Trp Thr Leu Leu Gin Glu Gin Gly Thr Lys Thr Val Arg Gin 

195 200 205 

Asn Leu Glu Pro Leu Phe Glu Gin Tyr He Asn Asn Leu Arg Arg Gin 

210 215 220 

Leu Asp Ser He Val Gly Glu Arg Gly Arg Leu Asp Ser Glu Leu Arg 
225 230 235 240 

Asn Met Gin Asp Leu Val Glu Asp Leu Lys Asn Lys Tyr Glu Asp Glu 

245 250 * 255 

He Asn Lys Arg Thr Ala Ala Glu Asn Glu Phe Val Thr Leu Lys Lys 

260 265 270 

Asp Val Asp Ala Ala Tyr Met Asn Lys Val Glu Leu Gin Ala Lys Ala 

275 280 285 

Asp Thr Leu Thr Asp Glu He Asn Phe Leu Arg Ala Leu Tyr Asp Ala 

290 295 300 

Glu Leu Ser Gin Met Gin Thr His He Ser Asp Thr Ser Val Val Leu 
305 310 315 320 

Ser Met Asp Asn Asn Arg Asn Leu Asp Leu Asp Ser He He Ala Glu 
325 330 335 
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Val 


Lys 


Ala 


Gin 


Tyr Glu Glu He 








340 




Glu 


Ser 


Trp 


Tyr 


Gin Thr Lys Tyr 






355 




360 


Arg 


His 


Gly 


Asp 


Asp Leu Arg Asn 




370 






375 


Asn 


Arg 


Met 


lie 


Gin Arg Leu Arg 


385 








390 


Gin 


Cys 


Ala 


Asn 


Leu Gin Ala Ala 










405 


Glu 


Met 


Ala 


Leu 


Lys Asp Ala Lys 








420 




Ala 


Leu 


Gin 


Lys 


Ala Lys Gin Asp 






435 




440 


Gin 


Glu 


Leu 


Met 


Asn Val Lys Leu 




450 






455 


Tyr 


Arg 


Lys 


Leu 


Leu Glu Gly Glu 


465 








470 


Val 


Gly 


Gin 


Val 


Asn He Ser Val 










485 


Tyr 


Gly 


Gly 


Ala 


Ser Gly Val Gly 








500 




Ser 


Ser 


Tyr 


Ser 


Tyr Gly Ser Gly 






515 




520 


Ser 


Ser 


Ser 


Gly 


Arg Ala Thr Gly 




530 






535 


Gly 


Ser 


Ser 


Thr 


He Lys Tyr Thr 


545 








550 


Ser 


Tyr 


Lys 


His 





<210> 454 



<211> 564 
<212> PRT 
<213> Homo sapiens 



<400> 454 






Met 


Ala 


Ser 


Thr 


Ser Thr Thr He 


1 








5 


Gly 


Phe 


Ser 


Ala 


Asn Ser Ala Arg 








20 




Phe 


Ser 


Ser 


He 


Ser Val Ser Arg 






35 




40 


Gly 


Ala 


Cys 


Gly 


Glv Ala Glv Phe 




50 






55" 


Gly 


Gly 


Ser 


Lys 


Arg He Ser He 


65 








70 


Gly 


Gly 


Tyr 


Gly 


Ser Arg Ala Arg 










85 


Gly 


Ser 


Gly 


Phe 


Gly Phe Gly Gly 








100 




Gly 


Gly 


Gly 


Ala 


Gly Leu Ala Gly 






115 




120 


Val 


Cys 


Pro 


Pro 


Gly Gly He Gin 




130 






135 


Leu 


Thr 


Pro 


Leu 


Asn Leu Gin He 


145 








150 


Ala 


Glu 


Glu 


Arg 


Glu Gin He Lys 










165 


Phe 


He 


Asp 


Lys 


Val Arg Phe Leu 








180 
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Ala 


Gin Arg 


Ser 


Arg 


Ala 


Glu 


Ala 


345 








350 






Glu 


Glu Leu 


Gin 


Val 


Thr 


Ala 


Gly 








365 








Thr 


Lys Gin 


Glu 


He 


Ala 


Glu 


He 




380 










Ser 


Glu He 


Asp 


His 


Val 


Lys 


Lys 




395 










400 


He 


Ala Asp 


Ala 


Glu 


Gin 


Arg 


Gly 




410 








415 




Asn 


Lys Leu 


Glu 


Gly 


Leu 


Glu 


Asp 


425 








430 






Leu 


Ala Arg 


Leu 


Leu 


Lys 


Glu 


Tyr 








445 








Ala 


Leu Asp 


Val 


Glu 


He 


Ala 


Thr 






460 










Glu 


Cys Arg 


Leu 


Asn 


Gly 


Glu 


Gly 




475 










480 


Val 


Gin Ser 


Thr 


Val 


Ser 


Ser 


Gly 




490 








495 




Ser 


Gly Leu 


Gly 


Leu 


Gly 


Gly 


Gly 


505 








510 






Leu 


Gly Val 


Gly Gly 


Gly 


Phe 


Ser 








525 








Gly 


Gly Leu 


Ser 


Ser 


Val 


Gly 


Gly 






540 










Thr 


Thr Ser 


Ser 


Ser 


Ser 


Arg 


Lys 




555 










560 


Arg 


Ser His 


Ser 


Ser 


Ser 


Arg 


Arg 




10 








15 




Leu 


Pro Gly 


Val 


Ser 


Arg 


Ser 


Gly 


25 








30 






Ser 


Arg Gly 


Ser Gly 


Gly 


Leu 


Gly 








45 








Gly 


Ser Arg 


Ser 


Leu 


Tyr 


Gly 


Leu 






60 










Gly 


Gly Gly 


Ser 


Cys 


Ala 


He 


Ser 




75 










80 


Gly 


Ser Tyr 


Gly 


Phe 


Gly 


Gly 


Ala 




90 








95 




Gly 


Ala Gly 


He Gly 


Phe 


Asp 


Leu 


105 








110 






Gly 


Phe Gly 


Gly Pro 


Gly 


Phe 


Pro 








125 








Glu 


Val Thr 


Val 


Asn 


Gin 


Ser 


Leu 






140 










Asp 


Pro Ala 


He 


Gin 


Arg 


Val 


Arg 




155 










160 


Thr 


Leu Asn 


Asn Lys 


Phe 


Ala 


Ser 




170 








175 




Glu 


Gin Gin 


Asn 


Lys 


Val 


Leu 


Asp 


185 








190 
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Thr 


Lys 


Trp 


Thr 


Leu Leu Gin 


Glu 


Gin 


Gly Thr Lys 


Thr 


Val 


Arg 


Gin 






195 






200 




205 






Asn 


Leu 


Glu 


Pro 


Leu Phe Glu 


Gin 


Tyr 


He Asn Asn 


Leu 


Arq 


Arg 


Gin 




210 






215 






220 






Leu 


Asp 


Asn 


He 


Val Gly Glu Arg 


Gly Arg Leu Asp 


Ser 


Glu 


Leu 


Arer 


225 








230 






z o o 








240 


Asn 


Met 


Gin 


Asp 


Leu Val Glu Asp 


T .on 


juys Asn Jjys 


Tvr 


Glu 


Asp 


Glu 






















255 




lie 


Asn 


Lys 


Arg 




r*"\ «<• 
ulU 


Asn 


\3±xx rne vai 


Thr 


Leu 


Lys 


Lvs 








260 






ZOO 






270 


Asp 


Val 


Asp 


Ala 


Ala Tyr Met 


Asn 


jjys 


vai Glu Leu 


Gin 


Ala 


Lys 


Ala 






275 






280 






285 






Asp 


Thr 


Leu 


Thr 


Asp Glu He 


Asn 


rile 


Jjcu Arg A±a 


Leu 


Tyr Asp 


Ala 




290 






295 
















Glu 


Leu 


Ser 


Gin 


Met Gin Thr 


His 


lie 


Ser Asp Thr 


Ser 


Val 


Val 


Leu 


305 








310 






315 








•J £* \J 


Ser 


Met 


Asp 


Asn 


Asn Arg Asn 


Leu 


ASp 


Leu Asp Ser 


He 


He 


Ala 


Glu 










325 






330 






335 




Val 


Lys 


Ala 


Gin 


Tyr Glu Glu 


He 


Ala 


Gin Arg Ser 




Ala 


Glu 


Ala 








340 






345 




350 






Glu 


Ser 


Trp 


Tyr 


Gin Thr Lys 


Tyr 


Glu 


Glu Leu Gin 


Val 


Thr 


Ala 


Gly 






355 






360 






365 






Arg 


His 


Gly 


Asp 


Asp Leu Arg 


Asn 


Thr 


Lys Gin Glu 


He 


Ala 


Glu 


He 




370 






375 






380 










Asn 


Arg 


Met 


He 


Gin Arg Leu Arg 


Ser 


Glu He Asp 


His 


Val 


Lys 




385 








390 






395 






t\J\J 


Gin 


Cys 


Ala 


Ser 


Leu Gin Ala 


Ala 


He 


Ala Asp Ala 


Glu 


Gin 


Arg 


Glv 










405 






410 






415 


Glu 


Met 


Ala 


Leu 


Lys Asp Ala Lys 


Asn 


Lys Leu Glu Gly 


Leu 


Glu 


Asp 








420 






425 






430 




Ala 


Leu 


Gin 


Lys 


Ala Lys Gin 


Asp 


Leu Ala Arg Leu 


Leu 


Lys 


Glu 


Tyr 






435 






440 






445 






Gin 


Glu 


Leu 


Met 


Asn Val Lys 


Leu 


Ala 


Leu Asp Val 


Glu 


He 


Ala 


Thr 




450 






455 






460 










Tyr 


Arg 


Lys 


Leu 


Leu Glu Gly Glu 


Glu 


Cys Arg Leu Asn 


Gly Glu 


Gly 


465 








470 






475 








480 


He 


Gly 


Gin 


Val 


Asn Val Ser 


val 


Val 


Gin Ser Thr 


He 


Ser 


Ser 


Gly 










485 






490 






495 


Tyr 


Gly 


Gly 


Ala 


Ser Gly Val 


Gly 


Ser 


Gly Leu Gly Leu 


Gly 


Gly Gly 








500 






505 






510 






Ser 


Ser 


Tyr 


Ser 


Tyr Gly Ser 


Gly 


Leu Gly He Gly Gly 


Gly 


Phe 


Ser 






515 






520 






525 








Ser 


Ser 


Ser 


Gly 


Arg Ala He 


Gly 


Gly Gly Leu Ser 


Ser 


Val 


Gly Gly 




530 






535 






540 










Gly 


Ser 


Ser 


Thr 


He Lys Tyr 


Thr 


Thr 


Thr Ser Ser 


Ser 


Ser 


Arg 


Lys 


545 








550 






555 






560 


Ser 


Tyr 


Lys 


His 


















<210> 455 




















<211> 564 




















<212> PRT 




















<213> Homo 


sapiens 

















<400> 455 

Met Ala Ser Thr Ser Thr Thr He 
1 5 
Gly Phe Ser Ala Asn Ser Ala Arg 
20 

Phe Ser Ser He Ser Val Ser Arg 
35 40 



Arg Ser His Ser Ser Ser Arg Arg 

10 15 
Leu Pro Gly Val Ser Arg Ser Gly 
25 * 30 

Ser Arg Gly Ser Gly Gly Leu Gly 
45 
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Gly 


Ala 


Cys 


Gly Gly 


Ala Gly Phe Gly 


Ser 


Arg 


Ser 


Leu 


Tyr 


Gly 


Leu 




50 








55 






60 










Glv 


Gly 


Ser 


Lys 


Arg 


He Ser He Gly 


Gly 


Gly 


Ser 


Cys 


Ala 


He 


Ser 


65 










70 




75 










80 


Glv 


Gly 


Tvr 


Gly 


Ser 


Arg Ala Arg Ala 


Ser 


Tvr 


Gly 


Phe 


Gly 


Gly 


Ala 










85 




90 










95 




Glv 


Ser 


Gly 


Phe Gly 


Phe Gly Gly Gly 


Ala 


Gly 


He 


Gly 


Phe 


Asp 


Leu 








100 




105 










110 






GlV 


Gly 


Glv 


Ala Gly 


Leu Ala Gly Gly 


Phe 


GlV 


Gly 


Pro 


Gly 

^2 


Phe 


Pro 






115 






120 








125 








Val 


Cvs 


Pro 


Pro Gly 


Gly He Gin Glu 


Val 


Thr 


Val 


Asn 


Gin 


Ser 


Leu 




130 








135 






140 










Leu 


Thr 


Pro 


Leu 


Asn 


Leu Gin He Asd 


Pro 


Ala 


He 


Gin 


Arcr 


Val 


Arcr 


145 










150 




155 










160 


Ala 


Glu 


Glu 


Arg 


Glu 


Gin He Lys Thr 


Leu 


Asn 


Asn 


Lvs 


Phe 


Ala 


Ser 










165 




170 










175 




Phe 


He 


Asp 


Lys 


Val 


Arg Phe Leu Glu 


Gin 


Gin 


Asn 


Lvs 


Val 


Leu 


Glu 








180 




185 










190 






Thx 


Lys 


XXJJ 


Thr 


Leu 


Leu Gin Glu Gin 


Glv 


Thr 


Lys 


Thr 


Val 


Arcr 


Gin 






Xv J 






200 








205 








Asn 


Leu 


Glu 


Pro 


Leu 


Phe Glu Gin Tvr 


He 


Asn 


Asn 


Leu 


Arcr 


Arcr 


Gin 




oin 








915 






220 










Leu 


Asp 


Ser 


He 


Val 


dl \r CZ~\ n Ara Glv 

VJ^jr \JX u fix >J V? J. JT 


Ara 


Leu 


Asp 


Ser 


Glu 


Leu 


Arcr 
















935 
a j 












vjrx. jr 


1*1 e l. 


m n 
ox. it 


Asp 


Leu 


\7s> 1 ml ii 31 en Php 

V CL -L V7X. U nop XT IXC? 


xj y 9 




T,VQ 

xjjr 9 


Tvr 


Glu 




Gin 

w J- IX 










245 




5 50 

U \J 










9 55 

<C- Z) 




Tie 

J. x c 


Asn 


U jr O 


Arg 


Thr 


JlI a- Ala Glu Asn 

X%XCl /TLX CI VJXU X"XE> AJL 


Glu 


Phe 


Val 


Thr 


Leu 


Lys 


Lys 








260 




5 65 

AOS 
















Asp 


Val 




Ala 


Ala 


Tvr Mel" Ann T,vr 


Val 


Glu 


Leu 


Gin 


Ala 


Lys 


Ala 






275 






280 








2 85 








Asp 


Thr 


Leu 


Thr Asp 


Glu Tie Asn Phe 


Leu 


Arcr 


Ala 


Leu 


Tvr 


Asp 


Ala 




9 QO 

j \j 














300 

O \J V 










Glu 


Leu 


Ser 


Gin 


Met 


Gin Thr His He 


Ser 


Asp 


Thr 


Ser 


Val 


Val 


Leu 


J U J 










J Xv 




315 

J JL J 










320 


Ser 


Met: 


Asp 


Asn 


Asn 


Arcr Asn Leu Asd 


Leu 


Asp 


Ser 


He 


He 


Ala 


Glu 










325 














3 3 5 

•J J 3 




V Cl. JL 




Al a 


Gin 


Tyr 


Glu G*!u Tie Ala 

VJ1U UJ.U X1C AJLCI 


Gin 


Arcr 


OCX. 




Ala 


Gin 

VJX. IX 


Ala 








340 




3 4 5 










3 50 
o ~j \J 






Glu 


Ser 




Tyr 


Gin 


TKt* Lvs Tvr Glu 


Glu 


Leu 


Gin 


Val 


Thr 


Ala 


Glv 






O -3 -> 






J u U 








3 65 








Ara 


His 


Glv 


Asp 


Asp 


Leu Aro Asn Thr 


Lys 


Gin 


Glu 


He 


Ala 


Glu 


He 




370 














380 










As xi 


Aircr 


Met 


He 


Gin 


At*ct Lieu Aro Ser 


Glu 


He 


Asp 


His 


Val 


Lys 


Lys 


385 










390 




395 










400 


Gin 


Cvs 


Ala 


Asn 


Leu 


Gin Ala Ala He 


Ala 


Asn 


Ala 


Glu 


Gin 


Ara 


Gly 










405 




410 










415 




Glu 


Met 


Ala 


Leu Lys 


A cjt-> a! a Lvs Asn 

rto £f OX CX XJjr 9 noil 


Lys 


Leu 


Glu 


Glv 


Leu 


Glu 


Asp 








420 




4.95 

"S ^ J 










430 






Ala 


Leu 


Gin 


Lys 


Ala 


Lvs Gin Asto Leu 


Ala 


Arcr 


Leu 


Leu 


Lvs 


Glu 


Tvr 

x jr x. 












440 








445 








Gin 


Glu 


Leu 


Met 


Asn 


Val Lys Leu Ala 


Leu 


Asp 


Val 


Glu 


He 


Ala 


Thr 




450 








455 






460 










Tvr 


Ara 


Lvs 


Leu 


Leu 


Glu Gly Glu Glu 


Cvs 

*~2 *"* 


Arq 


Leu 


Asn 


Gly 


Glu 


Gly 


465 










470 




475 










480 


Val 


Gly 


Gin 


Val 


Asn 


He Ser Val Val 


Gin 


Ser 


Thr 


Val 


Ser 


Ser 


Gly 










485 




490 










495 




Tyr 


Gly 


Gly 


Ala 


Ser 


Gly Val Gly Ser 


Gly 


Leu 


Gly 


Leu 


Gly 


Gly 


Gly 








500 




505 










510 






Ser 


Ser 


Tyr 


Ser 


Tyr 


Gly Ser Gly Leu 


Gly 


Val 


Gly 


Gly 


Gly 


Phe 


Ser 






515 






520 








525 








Ser 


Ser 


Ser 


Gly Arg 


Ala He Gly Gly 


Gly 


Leu 


Ser 


Ser 


Val 


Gly 


Gly 




530 








535 






540 










Gly 


Ser 


Ser 


Thr 


He 


Lys Tyr Thr Thr 


Thr 


Ser 


Ser 


Ser 


Ser 


Arg 


Lys 


545 










550 




555 










560 


Ser 


Tyr 


Lys 


His 
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<210> 456 
<211> 564 
<212> PRT 
<213> Homo sapiens 



<400> 456 

Met Ala Ser Thr Ser Thr Thr lie Arg Ser His Ser Ser Ser Arg Arg 

1 5 io 15 

Gly Phe Ser Ala Asn Ser Ala Arg Leu Pro Gly Val Ser Arg Ser Gly 

20 25 30 

Phe Ser Ser Val Ser Val Ser Arg Ser Arg Gly Ser Gly Gly Leu Gly 

35 40 45 

Gly Ala Cys Gly Gly Ala Gly Phe Gly Ser Arg Ser Leu Tyr Gly Leu 

50 55 60 

Gly Gly Ser Lys Arg lie Ser lie Gly Gly Gly Ser Cys Ala lie Ser 
65 70 75 80 

Gly Gly Tyr Gly Ser Arg Ala Gly Gly Ser Tyr Gly Phe Gly Gly Ala 

85 90 95 

Gly Ser Gly Phe Gly Phe Gly Gly Gly Ala Gly He Gly Phe Gly Leu 

100 105 no 

Gly Gly Gly Ala Gly Leu Ala Gly Gly Phe Gly Gly Pro Gly Phe Pro 

115 120 125 

Val Cys Pro Pro Gly Gly He Gin Glu Val Thr Val Asn Gin Ser Leu 

130 135 140 

Leu Thr Pro Leu Asn Leu Gin He Asp Pro Thr He Gin Arg Val Arg 
145 150 155 ~ 160 

Ala Glu Glu Arg Glu Gin He Lys Thr Leu Asn Asn Lys Phe Ala Ser 

165 170 175 

Phe He Asp Lys Val Arg Phe Leu Glu Gin Gin Asn Lys Val Leu Glu 

180 185 190 

Thr Lys Trp Thr Leu Leu Gin Glu Gin Gly Thr Lys Thr Val Arg Gin 

195 200 205 

Asn Leu Glu Pro Leu Phe Glu Gin Tyr He Asn Asn Leu Arg Arg Gin 

210 215 220 

Leu Asp Ser He Val Gly Glu Arg Gly Arg Leu Asp Ser Glu Leu Arg 
225 230 235 240 

Gly Met Gin Asp Leu Val Glu Asp Phe Lys Asn Lys Tyr Glu Asp Glu 

245 250 255 

He Asn Lys Arg Thr Ala Ala Glu Asn Glu Phe Val Thr Leu Lys Lys 

260 265 270 

Asp Val Asp Ala Ala Tyr Met Asn Lys Val Glu Leu Gin Ala Lys Ala 

275 280 285 

Asp Thr Leu Thr Asp Glu He Asn Phe Leu Arg Ala Leu Tyr Asp Ala 

290 295 300 

Glu Leu Ser Gin Met Gin Thr His He Ser Asp Thr Ser Val Val Leu 
305 310 315 320 

Ser Met Asp Asn Asn Arg Asn Leu Asp Leu Asp Ser He He Ala Glu 

325 330 335 

Val Lys Ala Gin Tyr Glu Glu He Ala Gin Arg Ser Arg Ala Glu Ala 

340 345 350 

Glu Ser Trp Tyr Gin Thr Lys Tyr Glu Glu Leu Gin Val Thr Ala Gly 

355 360 365 

Arg His Gly Asp Asp Leu Arg Asn Thr Lys Gin Glu He Ala Glu He 

370 375 3 8 o 

Asn Arg Met He Gin Arg Leu Arg Ser Glu He Asp His Val Lys Lys 
385 390 395 400 

Gin Cys Ala Asn Leu Gin Ala Ala He Ala Asp Ala Glu Gin Arg Gly 

405 410 415 

Glu Met Ala Leu Lys Asp Ala Lys Asn Lys Leu Glu Gly Leu Glu Asp 
420 425 43 0 
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Ala 


Leu 


Gin 


Lys 


Ala Lys Gin Asp 






435 




440 


Gin 


Glu 


Leu 


Met 


Asn Val Lys Leu 




4 50 

*± Zj \J 






455 


Tvr 




Lys 


Leu 


Leu Glu Gly Glu 


465 

rx O 3 








470 


Val 


Glv 


Gin 


Val 


Asn lie Ser Val 








485 


Tyr 


Gly 


Gly Ala 


Ser Gly Val Gly 








500 




Ser 


Ser 


Tyr 


Ser 


Tyr Gly Ser Gly 






515 




520 


Ser 


Ser 


Ser Gly 


Arg Ala He Gly 




530 






535 


Gly 


Ser 


Ser 


Thr 


He Lys Tyr Thr 


545 








550 


Ser 


Tyr 


Lys 


His 





Leu Ala Arg Leu Leu Lys Glu Tyr 
445 

Ala Leu Asp Val Glu He Ala Thr 
460 

Glu Cys Arg Leu Asn Gly Glu Gly 
475 480 
Val Gin Ser Thr Val Ser Ser Gly 

490 495 
Ser Gly Leu Gly Leu Gly Gly Gly 
505 510 
Leu Gly Val Gly Gly Gly Phe Ser 
525 

Gly Gly Leu Ser Ser Val Gly Gly 
540 

Thr Thr Ser Ser Ser Ser Arg Lys 
555 560 



<210> 457 



<211> 590 



<212> PRT 



<213> Homo sapiens 



<400> 457 

Met Ser Arg Gin Ser Ser Val Ser Phe Arg Ser Gly Gly Ser Arg Ser 

1 5 10 15 

Phe Ser Thr Ala Ser Ala He Thr Pro Ser Val Ser Arg Thr Ser Phe 

20 25 30 

Thr Ser Val Ser Arg Ser Gly Gly Gly Gly Gly Gly Gly Phe Gly Arg 

35 40 45 

Val Ser Leu Ala Gly Ala Cys Gly Val Gly Gly Tyr Gly Ser Arg Ser 

50 55 60 

Leu Tyr Asn Leu Gly Gly Ser Lys Arg He Ser He Ser Thr Arg Gly 
65 " ~ 70 75 80 

Gly Ser Phe Arg Asn Arg Phe Gly Ala Gly Ala Gly Gly Gly Tyr Gly 

85 90 95 

Phe Gly Gly Gly Ala Gly Ser Gly Phe Gly Phe Gly Gly Gly Ala Gly 

100 105 110 

Gly Gly Phe Gly Leu Gly Gly Gly Ala Gly Phe Gly Gly Gly Phe Gly 

115 120 125 

Gly Pro Gly Phe Pro Val Cys Pro Pro Gly Gly He Gin Glu Val Thr 

130 135 140 

Val Asn Gin Ser Leu Leu Thr Pro Leu Asn Leu Gin He Asp Pro Ser 
145 150 155 160 

He Gin Arg Val Arg Thr Glu Glu Arg Glu Gin He Lys Thr Leu Asn 

165 170 175 

Asn Lys Phe Ala Ser Phe He Asp Lys Val Arg Phe Leu Glu Gin Gin 

180 185 190 

Asn Lys Val Leu Asp Thr Lys Trp Thr Leu Leu Gin Glu Gin Gly Thr 

195 " 200 205 

Lys Thr Val Arg Gin Asn Leu Glu Pro Leu Phe Glu Gin Tyr He Asn 

210 215 220 

Asn Leu Arg Arg Gin Leu Asp Ser He Val Gly Glu Arg Gly Arg Leu 
225 230 235 240 

Asp Ser Glu Leu Arg Asn Met Gin Asp Leu Val Glu Asp Phe Lys Asn 

245 250 255 

Lys Tyr Glu Asp Glu He Asn Lys Arg Thr Thr Ala Glu Asn Glu Phe 

260 265 270 

Val Met Leu Lys Lys Asp Val Asp Ala Ala Tyr Met Asn Lys Val Glu 
275 ~ 280 285 
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Leu Glu Ala Lys Val Asp Ala Leu Met Asp Glu He Asn Phe Met Lys 

290 295 300 

Met Phe Phe Asp Ala Glu Leu Ser Gin Met Gin Thr His Val Ser Asp 
305 310 315 320 

Thr Ser Val Val Leu Ser Met Asp Asn Asn Arg Asn Leu Asp Leu Asp 

325 330 335 

Ser He He Ala Glu Val Lys Ala Gin Tyr Glu Glu He Ala Asn Arg 

340 345 350 

Ser Arg Thr Glu Ala Glu Ser Trp Tyr Gin Thr Lys Tyr Glu Glu Leu 

355 360 365 

Gin Gin Thr Ala Gly Arg His Gly Asp Asp Leu Arg Asn Thr Lys His 

370 375 380 

Glu He Thr Glu Met Asn Arg Met He Gin Arg Leu Arg Ala Glu He 
385 390 395 400 

Asp Asn Val Lys Lys Gin Cys Ala Asn Leu Gin Asn Ala He Ala Asp 

405 410 415 

Ala Glu Gin Arg Gly Glu Leu Ala Leu Lys Asp Ala Arg Asn Lys Leu 

420 425 430 

Ala Glu Leu Glu Glu Ala Leu Gin Lys Ala Lys Gin Asp Met Ala Arg 

435 440 445 

Leu Leu Arg Glu Tyr Gin Glu Leu Met Asn Thr Lys Leu Ala Leu Asp 

450 455 460 

Val Glu He Ala Thr Tyr Arg Lys Leu Leu Glu Gly Glu Glu Cys Arg 
465 470 475 480 

Leu Ser Gly Glu Gly Val Gly Pro Val Asn He Ser Val Val Thr Ser 

485 490 495 

Ser Val Ser Ser Gly Tyr Gly Ser Gly Ser Gly Tyr Gly Gly Gly Leu 

500 505 510 

Gly Gly Gly Leu Gly Gly Gly Leu Gly Gly Gly Leu Ala Gly Gly Ser 

515 520 525 

Ser Gly Ser Tyr Tyr Ser Ser Ser Ser Gly Gly Val Gly Leu Gly Gly 

530 535 540 

Gly Leu Ser Val Gly Gly Ser Gly Phe Ser Ala Ser Ser Gly Arg Gly 
54 5 550 555 560 

Leu Gly Val Gly Phe Gly Ser Gly Gly Gly Ser Ser Ser Ser Val Lys 

565 570 575 

Phe Val Ser Thr Thr Ser Ser Ser Arg Lys Ser Phe Lys Ser 
580 585 590 

<210> 458 
<211> 523 
<212> PRT 
<213> Homo sapiens 



<400> 458 




Met 


Ser 


Arg Gin 


Phe Thr Cys Lys 


1 






5 


Phe 


Ser 


Gly Cys 


Ser Ala Val Leu 






20 




Arg 


Ala 


Gly Ser 


Lys Gly Leu Ser 






35 


40 


Tyr 


Ser 


Leu Gly 


Gly Val Arg Ser 




50 




55 


Lys 


Ser 


Gly Gly 


Tyr Gly Phe Gly 


65 






70 


Gly 


Ser 


Met Phe 


Gly Ser Val Ala 








85 


Cys 


Pro 


Pro Gly 


Gly He His Gin 






100 




Ala 


Pro 


Leu Asn 


Val Glu Leu Asp 






115 


120 



Ser Gly Ala Ala Ala Lys Gly Gly 

10 15 
Ser Gly Gly Ser Ser Ser Ser Phe 
25 30 
Gly Gly Phe Gly Ser Arg Ser Leu 
45 

Leu Asn Val Ala Ser Gly Ser Gly 
60 

Arg Gly Arg Ala Ser Gly Phe Ala 

75 80 
Leu Gly Pro Val Cys Pro Thr Val 

90 95 
Val Thr Val Asn Glu Ser Leu Leu 
105 110 

Pro Glu He Gin Lys Val Arg Ala 
125 
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Gln Glu Arg Glu Gin lie Lys Ala Leu Asn Asn Lys Phe Ala Ser Phe 

130 " 135 140 

lie Asp Lys Val Arg Phe Leu Glu Gin Gin Asn Gin Val Leu Glu Thr 
145 ** 150 155 160 

Lys Trp Glu Leu Leu Gin Gin Leu Asp Leu Asn Asn Cys Lys Asn Asn 

165 170 175 

Leu Glu Pro He Leu Glu Gly Tyr He Ser Asn Leu Arg Lys Gin Leu 

180 185 190 

Glu Thr Leu Ser Gly Asp Arg Val Arg Leu Asp Ser Glu Leu Arg Asn 

195 " 200 205 

Val Arg Asp Val Val Glu Asp Tyr Lys Lys Arg Tyr Glu Glu Glu He 

210 215 220 

Asn Lys Arg Thr Ala Ala Glu Asn Glu Phe Val Leu Leu Lys Lys Asp 
225 ~ 230 235 240 

Val Asp Ala Ala Tyr Ala Asn Lys Val Glu Leu Gin Ala Lys Val Glu 

245 250 255 

Ser Met Asp Gin Glu He Lys Phe Phe Arg Cys Leu Phe Glu Ala Glu 

260 265 270 

He Thr Gin He Gin Ser His He Ser Asp Met Ser Val He Leu Ser 

275 280 285 

Met Asp Asn Asn Arg Asn Leu Asp Leu Asp Ser He He Asp Glu Val 

290 295 300 

Arg Thr Gin Tyr Glu Glu He Ala Leu Lys Ser Lys Ala Glu Ala Glu 
305 310 315 320 

Ala Leu Tyr Gin Thr Lys Phe Gin Glu Leu Gin Leu Ala Ala Gly Arg 

325 330 335 

His Gly Asp Asp Leu Lys Asn Thr Lys Asn Glu lie Ser Glu Leu Thr 

340 345 350 

Arg Leu He Gin Arg He Arg Ser Glu He Glu Asn Val Lys Lys Gin 

355 360 365 

Ala Ser Asn Leu Glu Thr Ala He Ala Asp Ala Glu Gin Arg Gly Asp 

370 375 380 

Asn Ala Leu Lys Asp Ala Arg Ala Lys Leu Asp Glu Leu Glu Gly Ala 
385 390 395 400 

Leu His Gin Ala Lys Glu Glu Leu Ala Arg Met Leu Arg Glu Tyr Gin 

405 410 415 

Glu Leu Met Ser Leu Lys Leu Ala Leu Asp Met Glu He Ala Thr Tyr 

420 425 430 

Arg Lys Leu Leu Glu Ser Glu Glu Cys Arg Met Ser Gly Glu Phe Pro 

435 440 445 

Ser Pro Val Ser He Ser He He Ser Ser Thr Ser Gly Gly Ser Val 

450 455 460 

Tyr Gly Phe Arg Pro Ser Met Val Ser Gly Gly Tyr Val Ala Asn Ser 
465 470 475 480 

Ser Asn Cys He Ser Gly Val Cys Ser Val Arg Gly Gly Glu Gly Arg 

485 490 495 

Ser Arg Gly Ser Ala Asn Asp Tyr Lys Asp Thr Leu Gly Lys Gly Ser 

500 505 510 

Ser Leu Ser Ala Pro Ser Lys Lys Thr Ser Arg 

515 520 
<210> 459 

<211> 529 

<212> PRT 

<213> Homo sapiens 



<400> 459 

Met Ser Arg Gin Leu Asn He Lys Ser Ser Gly Asp Lys Gly Asn Phe 
1 ~ 5 10 15 

Ser Val His Ser Ala Val Val Pro Arg Lys Ala Val Gly Ser Leu Ala 
20 25 30 
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Ser Tyr Cys Ala Ala Gly Arg Gly Ala Gly Ala Gly Phe Gly Ser Arg 

35 40 45 

Ser Leu Tyr Ser Leu Gly Gly Asn Arg Arg He Ser Phe Asn Val Ala 

50 55 60 

Gly Gly Gly Val Arg Ala Gly Gly Tyr Gly Phe Arg Pro Gly Ser Gly 
65 70 75 ~ " 80 

Tyr Gly Gly Gly Arg Ala Ser Gly Phe Ala Gly Ser Met Phe Gly Ser 

85 90 95 

Val Ala Leu Gly Pro Ala Cys Leu Ser Val Cys Pro Pro Gly Gly He 

100 105 no 

His Gin Val Thr Val Asn Lys Ser Leu Leu Ala Pro Leu Asn Val Glu 

115 120 125 

Leu Asp Pro Glu He Gin Lys Val Arg Ala Gin Glu Arg Glu Gin He 

130 135 140 

Lys Val Leu Asn Asp Lys Phe Ala Ser Phe He Asp Lys Val Arg Phe 
145 150 155 " 160 

Leu Glu Gin Gin Asn Gin Val Leu Glu Thr Lys Trp Glu Leu Leu Gin 

165 170 175 

Gin Leu Asp Leu Asn Asn Cys Lys Lys Asn Leu Glu Pro He Leu Glu 

180 185 190 

Gly Tyr He Ser Asn Leu Arg Lys Gin Leu Glu Thr Leu Ser Gly Asp 

195 200 205 

Arg Val Arg Leu Asp Ser Glu Leu Arg Ser Met Arg Asp Leu Val Glu 

210 215 220 

Asp Tyr Lys Lys Arg Tyr Glu Val Glu He Asn Arg Arg Thr Thr Ala 
225 230 235 " 240 

Glu Asn Glu Phe Val Val Leu Lys Lys Asp Ala Asp Ala Ala Tyr Ala 

245 250 ~ 255 

Val Lys Val Glu Leu Gin Ala Lys Val Asp Ser Leu Asp Lys Asp He 

260 265 270 

Lys Phe Leu Lys Cys Leu Tyr Asp Ala Glu He Ala Gin He Gin Thr 

275 280 285 

His Ala Ser Glu Thr Ser Val He Leu Ser Met Asp Asn Asn Arg Asp 

290 295 300 

Leu Asp Leu Asp Ser He He Ala Glu Val Arg Met His Tyr Glu Glu 
305 310 315 - 320 

He Ala Leu Lys Ser Lys Ala Glu Ala Glu Ala Leu Tyr Gin Thr Lys 

325 330 335 

He Gin Glu Leu Gin Leu Ala Ala Ser Arg His Gly Asp Asp Leu Lys 

340 345 350 

His Thr Arg Ser Glu Met Val Glu Leu Asn Arg Leu He Gin Arg He 

355 360 ' 365 

Arg Cys Glu He Gly Asn Val Lys Lys Gin Arg Ala Ser Leu Glu Thr 

370 375 380 

Ala He Ala Asp Ala Glu Gin Arg Gly Asp Asn Ala Leu Lys Asp Ala 
385 390 395 400 

Gin Ala Lys Leu Asp Glu Leu Glu Gly Ala Leu His Gin Ala Lys Glu 

405 410 415 

Glu Leu Ala Arg Met Leu Arg Glu Tyr Gin Glu Leu Met Ser Leu Lys 

420 425 430 

Leu Ala Leu Asp Met Glu He Ala Thr Tyr Arg Lys Leu Leu Glu Gly 

435 440 445 

Glu Glu Cys Arg Met Ser Gly Glu Asn Pro Ser Ser Val Ser He Ser 

450 455 460 

Val lie Ser Ser Ser Ser Tyr Ser Tyr His His Pro Ser Ser Ala Gly 
465 470 475 480 

Val Asp Leu Gly Ala Ser Ala Val Ala Gly Ser Ser Gly Ser Thr Gin 

485 490 495 

Ser Gly Gin Thr Lys Thr Thr Glu Ala Arg Gly Gly Asp Leu Lys Asp 

500 505 * 510 

Thr Gin Gly Lys Ser Thr Pro Ala Ser He Pro Ala Arg Lys Ala Thr 
515 520 525 

Arg 



<210> 460 
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<211> 511 
<212> PRT 
<213> Homo sapiens 



<400> 460 
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Arg 
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Leu 
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I1XD XT JUL C 
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20 








G q "D"V-» £i 

Oci JtrXlc 


Arg 
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35 










Lieu, ser 
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Ser Arg 
















Gly Gly 


Gly Arg 


ncu uiy 


65 








70 




Cat" 7\ 1 a 
oci nJla 


Gly Leu 


Gly Pro 


T «v/Ci Pt/q 
xjyes uya 








85 






11c JtTO 


Gin 


Val 


Thr 


Val 


Asa LyS 






100 








r*1 it Mot- 


Asp 


Pro 


Glu 


lie 


v3 m Arg 




115 








XrniV 


He Lys 


Ala 


Leu 


Asn 


Asn 


i-jy 0 Jrixe 


loU 










13 5 - 


lr lie Leu 


Glu 


Gin 


Gin 


Asn 


c*m val 


TAR 
14 3 








150 






Leu 


Asp 


Leu 


Asn 


ABn LyB 








165 






Pi ... Ol mm 


Tyr 


± J.e 


Ser 


Asn 


Leu Gin 






180 








/iop \Jly 


vai 


Arg 


Leu 


Asp 


oer i?iu 




1 QtZ 








A A 


ulU Asp 


A Y r 


iiys 


Lys 


Arg 


•TV r-v ^1 T « • 

iyr vixu 


51 O 

JL \J 












Ala ulU 


Asn 


Glu 


Phe 


Val 


vaj. lieu 










230 




Mat* 7\ «n 
HcL ASH 


Lys 


Val 


Glu 


Leu 


(jrin Aid 








245 






XX6 Lys 


Phe 


Phe 


Lys 


Cys 


iieu iyi 






260 








Ser- Hiq 

0Ci uJ.9 


lie 


Ser 


Asp 


Thr 


c pr Tie 

OCX xxc 




275 












Asp 


Leu 


Asp 


Ser 


He He 


290 










OQC 

m* J ZJ 


Glu lie 


Ala 


Leu 


Lys 


Ser 


Lys Ala 


305 








310 




Lys lie 


Gin 


Glu 


Leu 


Gin 


Val Thr 








325 






Lys Leu 


Thr 


Lys 


Ala 


Glu 


He Ser 






340 








lie Arg 


Ser 


Glu 


He 


Gly 


Asn Val 




355 








360 


Thr Ala 


He 


Ala 


Asp 


Ala 


Glu Gin 


370 










375 


Ala Arg 


Ala 


Lys 


Leu Asp 


Glu Leu 


385 








390 




Glu Glu 


Leu 


Ala 


Arg 


Met 


Leu Arg 








405 






Lys Leu 


Ala 


Leu 


Asp 


Met 


Glu He 






420 








Ser Glu 


Glu 


Cys 


Arg 


Met 


Ser Gly 




435 








440 
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Pro 




Glv 


Glu 




Leu 


Glv 


Phe 




10 










15 




oiy 


Gly 


He 


Glv 


Ser 


Ser 


Ser 


Ala 


25 










30 






Cav 
OCi 




Cor 

OCX. 


Phe 


Gl v 
uiy 


C A Y* 
OCi. 


uy 0 


Cpv 
OCl 










4.5 








Cay* 

Del 


Leu 


AXa 




Coy" 
OCl 


Ala 


Ala 


Ala 


















v3iy 


pV| P 


vdl 


Gl v 


TYi y* 
1X1X7 


-rV J- a 


pup 

XT IXC 


Gl \r 
uiy 
















AO 


pro 


Q Ay* 
OC1 


V ell 


cys 




XrXVO 


uiy 


«jiy 




90 










95 




OCX 


T.pii 


UC U 


Ala 




UC U 


nan 
x\s> XX 


v ai 


-L W _J 










HO 






Vdl 


Arg 


Ala 


v?in 


blu 


Arg 


pi 


vain 










l£J 








Aia 


Car- 

oer 


T»io 

irne 


ne 


Asp 


Lys 


vai 


Arg 








1ft u 










Leu 


ulU 


T*Vi y- 

inr 


Lys 


Trp 


Asn 


Leu 


Leu 






1DD 










lOU 


Arg 


Lys 


Asn 


Leu 


fin 

bill 


Pro 


Tl 0 

lie 


Tyr 




1 / u 










1 / D 




Lys 


m n 

ulQ 


Leu 


ulU 


Mot- 


Leu 


Ser 


pii . , 
vjiy 


185 










190 






Leu 


Arg 


Asn 


Mph 


Vain 


Asp 


Leu 


Val 










905 

a V -J 








Veil 


Gl n 


Tip 

lie 




Arg 


Arg 


inr 


Ala 








220 










Lys 


Lys 


Asp 


Val 


Asp 


Aia 


Ala 


Tyr 






235 










240 


Lys 


Val 


Asp 


oer 


Leu 


inr 


' ASp 


bill 




250 










955 
^ -j j 




Glu 


Glv 


Glu 


He 
j. j. c 




Gin 

VJ1U 


Tie 


Gin 


265 










270 






Val 


Leu 


Ser 


Met 


Asp 


Asn 


Asn 












285 








Ala 


Glu 


Val 


Arcr 


Ala 


Gin 


Tvr 
j 


Glu 








300 










Glu 


Ala 


Glu 


Thr 


Leu 


Tyr 


Gin 


Thr 






315 










320 


Ala 


Gly 


Gin 


His 


Gly 


Asp 


Asp 


Leu 




330 










335 




Glu 


Leu 


Asn 


Arg 


Leu 


He 


Gin 


Arg 


345 










350 






Lys 


Lys 


Gin 


Cys 


Ala 


Asp 


Leu 


Glu 










365 








Arg 


Gly 


Asp 


Cys 


Ala 


Leu 


Lys 


Asp 








380 










Glu 


Gly 


Ala 


Leu 


His 


Gin 


Ala 
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Pro Arg Ala Phe Ser Ser Arg Ser Tyr Thr Ser Gly Pro Gly Ser Arg 

20 25 30 

lie Ser Ser Ser Ser Phe Ser Arg Val Gly Ser Ser Asn Phe Arg Gly 

35 40 45 

Gly Leu Gly Gly Gly Tyr Gly Gly Ala Ser Gly Met Gly Gly lie Thr 

50 * 55 60 

Ala Val Thr Val Asn Gin Ser Leu Leu Ser Pro Leu Val Leu Glu Val 
65 70 75 80 

Asp Pro Asn lie Gin Ala Val Arg Thr Gin Glu Lys Glu Gin He Lys 

85 90 95 

Thr Leu Asn Asn Lys Phe Ala Ser Phe He Asp Lys Val Arg Phe Leu 

100 105 HO 

Glu Gin Gin Asn Lys Met Leu Glu Thr Lys Trp Ser Leu Leu Gin Gin 

115 120 125 

Gin Lys Thr Ala Arg Ser Asn Met Asp Asn Met Phe Glu Ser Tyr He 

130 135 140 

Asn Asn Leu Arg Arg Gin Leu Glu Thr Leu Gly Gin Glu Lys Leu Lys 
145 150 155 160 

Leu Glu Ala Glu Leu Gly Asn Met Gin Gly Leu Val Glu Asp Phe Lys 

165 170 175 

Asn Lys Tyr Glu Asp Glu lie Asn Lys Arg Thr Glu Met Glu Asn Glu 

180 185 190 

Phe Val Leu He Lys Lys Asp Val Asp Glu Ala Tyr Met Asn Lys Val 

195 200 205 

Glu Leu Glu Ser Arg Leu Glu Gly Leu Thr Asp Glu He Asn Phe Leu 

210 215 220 

Arg Gin Leu Tyr Glu Glu Glu He Arg Glu Leu Gin Ser Gin lie Ser 
225 230 235 240 

Asp Thr Ser Val Val Leu Ser Met Asp Asn Ser Arg Ser Leu Asp Met 

245 250 255 

Asp Ser He He Ala Glu Val Lys Ala Gin Tyr Glu Asp He Ala Asn 
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Arg Ser Arg Ala Glu Ala Glu Ser Met Tyr Gin He Lys Tyr Glu Glu 
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Leu Gin Ser Leu Ala Gly Lys His Gly Asp Asp Leu Arg Arg Thr Lys 
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Thr Glu He Ser Glu Met Asn Arg Asn He Ser Arg Leu Gin Ala Glu 
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He Glu Gly Leu Lys Gly Gin Arg Ala Ser Leu Glu Ala Ala He Ala 
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Asp Ala Glu Gin Arg Gly Glu Leu Ala He Lys Asp Ala Asn Ala Lys 
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Leu Ser Glu Leu Glu Ala Ala Leu Gin Arg Ala Lys Gin Asp Met Ala 
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Arg Gin Leu Arg Glu Tyr Gin Glu Leu Met Asn Val Lys Leu Ala Leu 
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Asp He Glu He Ala Thr Tyr Arg Lys Leu Leu Glu Gly Glu Glu Ser 
385 390 395 400 

Arg Leu Glu Ser Gly Met Gin Asn Met Ser He His Thr Lys Thr Thr 

405 410 415 

Ser Gly Tyr Ala Gly Gly Leu Ser Ser Ala Tyr Gly Gly Leu Thr Ser 
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Pro Gly Leu Ser Tyr Ser Leu Gly Ser Ser Phe Gly Ser Gly Ala Gly 

435 440 445 

Ser Ser Ser Phe Ser Arg Thr Ser Ser Ser Arg Ala Val Val Val Lys 
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Lys He Glu Thr Arg Asp Gly Lys Leu Val Ser Glu Ser Ser Asp Val 
465 470 475 480 

Leu Pro Lys 

<210> 467 

<211> 430 



<212> PRT 



BHC03 1 051 



<213> Homo sapiens 



<400> 467 



Met 


Ser 


Phe 


Thr 


Thr 


Arg 


Ser 


Thr 


1 








5 








Gly 


Ser 


Val 


Gin 


Ala 


Pro 


Ser 


TVr 
xyx 








20 








Ala 


Ser 


Val 


Tvr 


Ala 


Glv 


Ala 


Glv 
varxy 






35 










40 


Ser 


Arci 


Ser 


Thr 


Ser 


Phe 




Glv 




50 










55 




Thr 


Glv 


lie 


Ala 


Glv 

vjxy 


Glv 


Leu 


Ala 


65 










70 






Iiys 


Glu 


Thr 


Met 


Gin 


Ser 


XJC u 


nan 










85 








Arcr 


Val 


Ara 
m y 


Ser 


Leu 


Glu 


Thr* 

X XIX 


Glu 








100 










Ai-cr 


Glu 


His 


uc u 


gi ii 

VJi IX 


T ."V^Q 
XJJf £> 


T 

Lys 


Gl vr 






115 










120 


Tvr 

A y * 


Phe 


Lys 


Tie 

x 


lie 

XXC 


Gl ii 

V7X IX 


7\ on 


UCU 




130 










135 




Val 


Asp 


Asn 


Ala 




T1 


Val 
Val 


T 

Lieu 


145 










150 






Ala 




TV on 




Arg 


Vex J. 


T .vo 
Xiy S 


Tyr 










X O J 








Ser 


V d X 


Glu 


7\ on 
noil 


nap 


Tl f= 
lie 


XIX s 


Gl \r 








IRQ 
J. o u 










Asn 


lie 


Thr 




T ,pii 
XJC u 


Gl n 

V3X.11 


Xieu 


Gl ii 

VjrXU 






195 












Glu 


Leu 


Leu 


Phe 


1*1 G i. 


_ 

Lys 


_ 

Lys 


Asn 




210 










Z ID 




Gin 


Ala 


Gin 


Tie 

X XC 


Al a 


Car 
OCX 


OCX 


Gl \r 


225 










<o J V 




xjy o 


OCX 


Gl n 


A CT^ 

nap 




Al a 

Hid. 


Lys 


Tl <» 
lie 










245 








Asp 


Glu 


Leu 


Ala 




XI jf £> 


A an 
noli 


Arg 








260 










Gin 


Gin 


lie 


Glu 


Glu 


OCX 


X11X 


TVi-r 

1 11 X 






275 










280 


Val 


Glv 


Ala 


Ala 


Glu 


Thr 


Thr 


XJCU 




290 










295 




Ser 


Leu 


Glu 


lie 


Asp 


Leu 


Asp 


Ser 


305 










310 






Glu 


Asn 


Ser 


Leu 


Arg 


Glu 


Val 


Glu 










325 








Gin 


Leu 


Asn 


Gly 


lie 


Leu 


Leu 


His 








340 










Arg 


Ala 


Glu 


Gly 


Gin 


Arg 


Gin 


Ala 






355 










360 


lie 


Lys 


Val 


Lys 


Leu 


Glu 


Ala 


Glu 




370 










375 




Glu 


Asp 


Gly 


Glu 


Asp 


Phe 


Asn 


Leu 


385 










390 






Ser 


Met 


Gin 


Thr 


lie 


Gin 


Lys 


Thr 










405 








Lys 


Val 


Val 


Ser 


Glu 


Thr 


Asn 


Asp 



420 
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Phe 


Ser 


Thr 


Asn 


Tvr 

xy x 




Ser 


XJC? u 




10 










15 




Glv 
v»xy 


Ala 


At^t 

Axrg 


C X \J 


Val 

val 


Ser 


Q -y- 

OC1 


Al a 
nia 


25 










30 






Glv 
•oxy 


Ser 


Glv 
vaxy 


Ser 


At*ct 

Axrg 


He 


OCX 


Val 
val 










45 ■ 








Glv 


Met: 


Gl v 
vjxy 


OCX 

60 


Gl v 


Gl v 


T.oii 


al a 

Hid 


Glv 
v»xy 


Met: 


Gl v 


Gl v 


Tl ^ 
X X c 


Gl n 

ulll 


Asn 


Gl ii 

UlU 






75 










a \J 


nop 


Arg 


XJfc? U 


al a 

Ala 


oer 


Tyr 


Leu 


Asp 




gn 












Aon 
noil 


Arg 




Leu 


Gl ii 
OlU 


oer 


Lys 


j.ie 


105 










11 u 






Pro 


VjXH 


Val 

vax 


Arg 


Asp 


Trp 


Ser 


HI S 


















Airg 


Al a 
Mia 


Gl n 

oxn 


Tl <=» 
lie 


■flits 


al a 

Aid 


Asn 


mr 








X*± \J 










Gl Tl 
UlU 


Tl <» 
lie 


nop 


Asn 


2Vl za 
Aid 


Arg 


Leu 


Ax a 






155 










i fin 

X o u 


Gl ii 


IXlx 


/5i ii 


Leu 


TV 1 — 

Ax a 


nee 


Arg 


Gin 




X / U 










X JO 




Leu 


Arg 


Lys 


Via 1 
Veil 


lie 


Asp 


Asp 


Tnr 


1 ft^ 










1 OH 






1 llx 


Gl ii 


Tl <=» 


m ii 

vjJLU 


Aid 


Leu 


Lys 


GlU 










one 








XlX S 


rsi ii 

oxU 


^ii n 


f2T ii 


vaj. 


Lys 


G±y 


Leu 








o*5 n 
z z, u 










Leu 


xnr 


vax 


bill 


vax 


ASp 


A±a 


Pro 






*3 1 C 










240 


wee 


Ala 


TV nn 

ASp 


ixe 


Arg 


TV T _ 

Ala 


Gin 


Tyr 














*3 C C 




Gl ii 
W1U 


Gl ii 
VjX u 


Leu 


ASp 


Lys 


rru 

Tyr 


Trp 


ser 


265 










§ \J 






Val 

vax 


Val 
Val 


1 XlX 


xilx 


f2l n 


oer 


Aid. 


r^l ii 










Ct o 








X 11X 


Gl ii 
vtXU 


T.A11 


Arg 


Arg 


1 ill 


\7a 1 

vai 


r>l vi 

bin 








J V V 












Arg 


Asn 


Leu 


Lys 


al a 

ivlol 


Cor 

oer 


Leu 






315 










320 


Ala 


Arg 


Tyr 


Ala 


Leu 


Gin 


Met 


Glu 




330 










335 




Leu 


Glu 


Ser 


Glu 


Leu 


Ala 


Gin 


Thr 


345 










350 






Gin 


Glu 


Tyr 


Glu 


Ala 


Leu 


Leu 


Asn 










365 








He 


Ala 


Thr 


Tyr 


Arg 


Arg 


Leu 


Leu 








380 










Gly 


Asp 


Ala 


Leu 


Asp 


Ser 


Ser 


Asn 






395 










400 


Thr 


Thr 


Arg 


Arg 


He 


Val 


Asp 


Gly 




410 










415 




Thr 


Lys 


Val 


Leu 


Arg 


His 







425 430 
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<400> 468 








Met 


Val 


Ala Arg 


Val 


Gly Leu Leu 


1 






5 






Trp 


Gly 


His Leu 


Asp Ala Gin Pro 






20 










Lys 


Glu Ala 


ulU 


Ala 


rxie lieu 












A n 


Gin 


Val 


riv uyo 


Ala 


Fro 


Thr Ser 












c c 


Ala 


Phe 


VJXll X X JkJ 


Val 


Ser 


Gin Leu 










70 




Ala 


x xix 


xicu nx y 


/"IT. «r% 

Gin 


Met 


Thr Arg 








85 






Asn 


Ser 


TVt* Ala 
x y x rua 


Ala 


Trp 


Ala Glu 






i on 








Ilis 


Arg 


inr juy s 


Met 


Arg 


Arg Lys 






11j 






12 0 


T .\ro. 
xiy d 


xirp 


r P\nr T.v/'Q 
iy i uy o 


Gin 


His 


Leu Ser 












135 


£115 


Leu 


riU vjrlu 


Pro 


Ala 


Val Arg 










150 


T.on 
ucu 


lrp 


OCX M.OX1 


Val 


Ser 


Ala Leu 








165 






vxly 


Pro 


a TV » 
Aid /\Sp 


He 


Arg 


Leu Thr 






1 O A 

loU 










Leu 


vjxy Asn 


Ala 


Phe Asp Gly 












200 




Leu 


rxo /iiy 


Arg Gly Glu Ala 




OTA 

z 1U 








215 


Cpy- 


Leu 


Cor 

ocx fug 


Arg Arg Gly Arg 


OOC 








230 




wX U 


JLXC 


vjxy nio 


Thr 


Leu 


Gly Leu 








245 






T.on 

•UC u 


Mot* 


Ala D 

MX d. Jtrxo 


Tyr 


Tyr 


Lys Arg 






O £ A 










Asp 


ASp Vol 


Leu 


Ala 


Val Gin 






A / D 






280 


vjxy 


OCX 


VdX nla 


Val 


Gin 


Leu Pro 




o on 








295 


Thr 


Trp 


Asp Ser 


Tyr 


Ser 


Pro Gin 


305 








310 




Pro 


Lys 


Tyr Cys 


His 


Ser 


Ser Phe 








325 






Gin 


Gin 


Leu Tyr 


He 


Phe 


Lys Gly 






340 








Asp 


Gly 


Asn Val 


Ser 


Glu 


Pro Arg 






355 






360 


Leu 


Pro 


Pro Asn 


He 


Glu 


Ala Ala 




370 








375 


Phe 


Tyr 


Phe Phe 


Lys 


Val 


Gin Ser 


385 








390 





<210> 469 



<211> 851 
<212> PRT 



Leu Ato Ala 


Leu 


Gin 


j-ic \JL 




ucu 


10 

X \J 








ID 




Ala Glu Arg 


Gly 


Gly 


Gin 


Glu 


Leu 








A 






Glu Lys Tyr 


Gly 


Tyr 


Leu 


Asn 


Glu 


fTiT" Him DVip 
xxix nxy Jcr lie 


ocx 

C A 


Asp 


nla 


TT o 

lie 


Arg 


*Dy~t~\ Oar 
rx W Val ocx 


vjj.y 


Val 


Leu 


Asp 


Arg 


/5 










80 


irxvj Hiy v»Y© 


m v 

uiy 


Veil 


X 111 


Asp 


1 XXX 










- f\ t 
95 




Arg He Ser 


Asp 


Leu 


Phe 


Ala 


Arg 


1 A C 

JLU5 






110 






Lys Arg Phe 


Ala 


Lys 
125 


Gin 


Gly 


Asn 


Tyr Arg Leu 


val 
140 


Asn 


Trp 


Pro 


Glu 


pi TV I — TT 9 1 

uxy ax a vai 


Arg 


Ala 


TV T <m 

Ala 


Pne 


Gin 


ICC 










160 


Glu Phe Trp 


G1U 


ill a 

Ala 


Pro 


TV 1 -» 

Ala 


Thr 


170 








175 




Phe Phe Gin 


Gly 


Asp 


His 


Asn 


Asp 


185 






190 






Pro Gly Gly 


Ala 


Leu 
205 


Ala 


His 


Ala 


ill S rile ASp 


vain 
220 


Asp 


GlU 


TV -u-j-m 

Arg 


Trp 


Asn Leu fxie 


vai 


vai 


Leu 


Ala 


HIS 


235 










240 


Thr His Ser 


Pro 


Ala 


Pro 


Arg 


-7. T 

Ala 


250 








255 




Leu Gly Arg 


Asp 


Ala 


Leu 


Leu 


Ser 








270 






Ser Leu Tyr 


Glv 

\JXJT 


Lys 
285 


Pro 


Leu 


Gly 


Gly Lys Leu 


Phe 
300 


Thr 


Asp 


Phe 


Glu 


Gly Arg Arg 


Pro 


Glu 


Thr 


Gin 


Gly 


315 










320 


Asp Ala He 


Thr 


Val 


Asp 


Arg 


Gin 


330 








335 




Ser His Phe 


Trp 


Glu 


Val 


Ala 


Ala 


345 






350 






Pro Leu Gin 


Glu 


Arg 
365 


Trp 


Val 


Gly 


Ala Val ser 


Leu 


Asn 


Asp 


Gly 


Asp 



380 
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<400> 469 
Met Ala Gin Trp Glu 
1 5 
Gin Leu His Gin Leu 
20 

Gin Tyr Leu Ala Val 
35 

Leu Gly Ser Asp Asp 
50 

Asp Gin Leu Asn Tyr 
65 

Leu Leu Leu Gin His 
85 

Phe Ser Gin Asp Pro 
100 

Leu Glu Glu Lys Arg 
115 

Gin Gly Glu Pro Val 
130 

He Glu Ser Arg lie 
145 

Lys Ser He Ser Gin 
165 

Tyr Lys lie Gin Ala 
180 

Thr Lys Glu Gin Lys 
195 

Arg Arg Lys Glu Val 
210 

Thr Thr Leu He Glu 
225 

Gin Gin Gin Lys Ala 
245 

Gin Leu Glu Thr Trp 
260 

Arg Gin Leu Leu Lys 
275 

Gin Asp Asp Pro Leu 
290 

Thr Glu Leu Leu Gin 
305 

Gin Pro Cys Met Pro 
325 

Gly Ser Lys Phe Thr 
340 

Gly Asn Glu Ser Leu 
355 

Gin Leu Gin Gly Phe 
370 

Thr Leu Thr Pro Glu 
385 

Gly Tyr Leu Thr Leu 
405 

Ser Asn Lys Gly Pro 
420 

Phe Thr Val Lys Tyr 
435 

Asp Thr Leu Pro Val 
450 

Ala Trp Ala Ser Val 
465 

Asn Gin Gin Phe Phe 
485 

Gly Pro Ala Leu Ser 
500 



Met 


Leu 


Gin Asn Leu Asp 






10 


Tyr 


Ser 


His Ser Leu Leu 






25 


Trp 


He 


Glu Asp Gin Asn 






40 


Ser 


Lys 


Ala Thr Met Leu 




55 




Glu 


Cys 


Gly Arg Cys Ser 


70 




75 


Asn 


Leu 


Arg Lys Phe Cys 






90 


Thr 


Gin 


Leu Ala Glu Met 






105 


He 


Leu 


He Gin Ala Gin 






120 


Leu 


Glu 


Thr Pro Val Glu 




135 




Leu 


Asp 


Leu Arg Ala Met 


150 




155 


Leu 


Lys 


Asp Gin Gin Asp 






170 


Lys 


Gly 


Lys Thr Pro Ser 






185 


He 


Leu 


Gin Glu Thr Leu 






200 


Leu 


Asp 


Ala Ser Lys Ala 




215 




Leu 


Leu 


Leu Pro Lys Leu 


230 




235 


Cys 


He 


Arg Ala Pro He 






250 


Phe 


Thr 


Ala Gly Ala Lys 






265 


Glu 


Leu 


Lvs Glv Leu Ser 






280 


Thr 


Lvs 

w jr o 


Glv Val Asn Leu 




295 




Arq 


Leu 


Leu His Arcr Ala. 


310 




315 


Gin 


Thr 


Pro His Arg Pro 






330 


Val 


Arg 


Thr Arg Leu Leu 






345 


Thr 


Val 


Glu Val Ser He 






360 


Arq 


Lvs 


Phe Asn He Leu 




375 




Lys 


Gly 


Gin Ser Gin Gly 


390 




395 


Val 


Glu 


Gin Arg Ser Gly 






410 


Leu 


Gly 


Val Thr Glu Glu 






425 


Thr 


Tyr 


Gin Gly Leu Lys 






440 


Val 


He 


He Ser Asn Met 




455 




Leu 


Trp 


Phe Asn Leu Leu 


470 




475 


Ser 


Asn 


Pro Pro Lys Ala 






490 


Trp 


Gin 


Phe Ser ser Tyr 



505 



Ser Pro Phe Gin Asp 
15 

Pro Val Asp He Arg 
30 

Trp Gin Glu Ala Ala 
45 

Phe Phe His Phe Leu 
60 

Gin Asp Pro Glu Ser 
80 

Arg Asp He Gin Pro 
95 

He Phe Asn Leu Leu 
110 

Arg Ala Gin Leu Glu 
125 

Ser Gin Gin His Glu 
140 

Met Glu Lys Leu Val 
160 

Val Phe Cys Phe Arg 
175 

Leu Asp Pro His Gin 
190 

Asn Glu Leu Asp Lys 
2 05 

Leu Leu Gly Arg Leu 
220 

Glu Glu Trp Lys Ala 
240 

Asp His Gly Leu Glu 
255 

Leu Leu Phe His Leu 
270 

Cys Leu Val Ser Tyr 
285 

Arg Asn Ala Gin Val 
300 

Phe Val Val Glu Thr 
320 

Leu He Leu Lys Thr 
335 

Val Arg Leu Gin Glu 
350 

Asp Arg Asn Pro Pro 
365 

Thr Ser Asn Gin Lys 
380 

Leu He Trp Asp Phe 
400 

Gly Ser Gly Lys Gly 
415 

Leu His He He Ser 
430 

Gin Glu Leu Lys Thr 
445 

Asn Gin Leu Ser He 
460 

Ser Pro Asn Leu Gin 
480 

Pro Trp Ser Leu Leu 
495 

Val Gly Arg Gly Leu 
510 



BHC03 1 051 



-278- 

Asn Ser Asp Gin Leu Ser Met Leu Arg Asn Lys Leu Phe Gly Gin Asn 

515 520 525 

Cys Arg Thr Glu Asp Pro Leu Leu Ser Trp Ala Asp Phe Thr Lys Arg 

530 535 540 

Glu Ser Pro Pro Gly Lys Leu Pro Phe Trp Thr Trp Leu Asp Lys lie 
545 550 555 560 

Leu Glu Leu Val His Asp His Leu Lys Asp Leu Trp Asn Asp Gly Arg 

565 570 575 

He Met Gly Phe Val Ser Arg Ser Gin Glu Arg Arg Leu Leu Lys Lys 

580 585 590 

Thr Met Ser Gly Thr Phe Leu Leu Arg Phe Ser Glu Ser Ser Glu Gly 

595 600 605 

Gly He Thr Cys Ser Trp Val Glu His Gin Asp Asp Asp Lys Val Leu 

610 615 620 

He Tyr Ser Val Gin Pro Tyr Thr Lys Glu Val Leu Gin Ser Leu Pro 
625 630 635 640 

Leu Thr Glu He He Arg His Tyr Gin Leu Leu Thr Glu Glu Asn He 

645 ^ 650 655 

Pro Glu Asn Pro Leu Arg Phe Leu Tyr Pro Arg He Pro Arg Asp Glu 

660 665 670 

Ala Phe Gly Cys Tyr Tyr Gin Glu Lys Val Asn Leu Gin Glu Arg Arg 

675 680 685 

Lys Tyr Leu Lys His Arg Leu He Val Val Ser Asn Arg Gin Val Asp 

690 695 700 

Glu Leu Gin Gin Pro Leu Glu Leu Lys Pro Glu Pro Glu Leu Glu Ser 
705 710 715 720 

Leu Glu Leu Glu Leu Gly Leu Val - Pro Glu Pro Glu Leu Ser Leu Asp 

725 " 730 735 

Leu Glu Pro Leu Leu Lys Ala Gly Leu Asp Leu Gly Pro Glu Leu Glu 

740 745 750 

Ser Val Leu Glu Ser Thr Leu Glu Pro Val He Glu Pro Thr Leu Cys 

755 760 765 

Met Val Ser Gin Thr Val Pro Glu Pro Asp Gin Gly Pro Val Ser Gin 

770 775 780 

Pro Val Pro Glu Pro Asp Leu Pro Cys Asp Leu Arg His Leu Asn Thr 
785 790 795 800 

Glu Pro Met Glu He Phe Arg Asn Cys Val Lys He Glu Glu He Met 

805 810 815 

Pro Asn Gly Asp Pro Leu Leu Ala Gly Gin Asn Thr Val Asp Glu Val 

820 825 830 

Tyr Val Ser Arg Pro Ser His Phe Tyr Thr Asp Gly Pro Leu Met Pro 

835 840 845 

Ser Asp Phe 

850 
<210> 470 

<211> 335 

<212> PRT 

<213> Homo sapiens 



<400> 470 

Met Gly Lys Val Lys Val Gly Val 
1 5 
Leu Val Thr Arg Ala Ala Phe Asn 
20 

He Asn Asp Pro Phe He Asp Leu 

35 40 
Tyr Asp Ser Thr His Gly Lys Phe 

50 55 
Gly Lys Leu Val He Asn Gly Asn 
65 70 



Asn Gly Phe Gly Arg He Gly Arg 

10 15 
Ser Gly Lys Val Asp He Val Ala 
25 30 
Asn Tyr Met Val Tyr Met Phe Gin 
45 

His Gly Thr Val Lys Ala Glu Asn 
60 

Pro He Thr He Phe Gin Glu Arg 
75 80 
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Asp 


Pro 


Ser 


Lys 


He 


Lys 


Trp Gly 










85 








Glu 


Ser 


Thr 


Gly 
100 


Val 


Phe 


Thr 


Thr 


Gin 


Gly 


Gly Ala 


Lys 


Arg 


Val 


He 






115 










120 


Pro 


Met 
130 


Phe 


Val 


Met 


Gly 


Val 
135 


Asn 


Lys 


He 


He 


Ser 


Asn 


Ala 


Ser 


Cys 


145 










150 




Ala 


Lys 


Val 


He 


His 
165 


Asp 


Asn 


Phe 


Thr 


Val 


His 


Ala 
180 


He 


Thr 


Ala 


Thr 


Gly 


Lys 


Leu 


Trp 


Arg 


Asp 


Gly Arg 






195 










200 


Ala 


Ser 


Thr Gly 


Ala 


Ala 


Lys 


Ala 




210 










215 




Asn 


Gly 


Lys 


Leu 


Thr 


Gly 


Met 


Ala 


225 










230 






Ser 


Val 


Val 


Asp 


Leu 
245 


Thr 


Cys 


Arg 


Asp 


He 


Lys 


Lys 
260 


Val 


Val 


Lys 


Gin 


He 


Leu 


Gly Tyr 


Thr 


Glu 


His 


Gin 






275 










280 


Asp 


Thr 
290 


His 


Ser 


Ser 


Thr 


Phe 
295 


Asp 


Asp 


His 


Phe 


Val 


Lys 


Leu 


He 


Ser 


305 










310 






Ser 


Asn 


Arg 


Val 


Val 
325 


Asp 


Leu 


Met 
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Asp 


Ala 


Gly 


Ala 


Glu 


Tvr 


Val 


Val 




90 










95 




Met 


Glu 


Lys 


Ala 


Gly 


Ala 


His 


Leu 


105 










110 






He 


Ser 


Ala 


Pro 


Ser 


Ala 


Asp 


Ala 










125 






His 


Glu 


Lys 


Tyr 


Asp 


Asn 


Ser 


Leu 








140 










Thr 


Thr 


Asn 


Cvs 


Leu 


Ala 


Pro 


Leu 






155 










160 


Gly 


He 


Val 


Glu 


Glv 


Leu 


Met 


Thr 




170 










175 




Gin 


Lys 


Thr 


Val 


Asp 


Gly 


Pro 


Ser 


185 










190 
j. j \j 






Gly Ala 


Leu 


Gin 


Asn 


He 


He 


Pro 










205 








Val 


Gly 


Lvs 


Val 


He 


Pro 


Glu 


Leu 








220 










Phe 


Arg 


Val 


Pro 


Thr 


Ala 


Asn 


Val 






235 










** \J 


Leu 


Glu 


Lvs 


Pro 


Ala 


LVS 


Tvr 


Asp 




250 










^ -J ~> 




Ala 


Ser 


Glu 


Glv 


Pro 


Leu 


Lys 


Gly 


265 










270 






Val 


Val 


Ser 


Ser 


Asp 


Phe 


Asn 


Ser 










285 








Ala Gly 


Ala 


Gly 


He 


Ala 


Leu 


Asn 








300 










Trp 


Tyr 


Asp 


Asn 


Glu 


Phe 


Gly 


Tyr 






315 










320 


Ala 


His 


Met 


Ala 


Ser 


Lys 


Glu 






330 










335 





<220> 




<223> 


NAP4 Probe 


<400> 


471 


tccgcctcag tcgcctcttt eg 


<210> 


472 


<211> 


19 


<212> 


DNA 


<213> 


Artificial Sequence 



<220> 

<223> NAP4 FOR PRIMER 
<400> 472 

teggaaggge tccttcaaa 
<210> 473 



<211> 19 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> NAP4 REV PRIMER 

<400> 473 

caccgttgca gctcttggt 

<210> 474 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MRLP45 Probe 

<400> 474 

ctcccattcc cctcatgcta taaaaagaac tacc 

<210> 475 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MRLP45 FOR PRIMER 

<400> 475 

ggctgctgga agctttgaag 

<210> 476 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MRLP4 5 REV PRIMER 

<400> 476 

tgagcaggat gggagagaac a 

<210> 477 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> TCF2 Probe 

<400> 477 

caaaagctgg ccatggacgc ct 

<210> 478 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> TCF2 FOR PRIMER 

<400> 478 

gcaggaagga ggaggcattc 

<210> 479 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> TCF2 REV PRIMER 

<400> 479 

caggctgtga gtctggttgg a 

<210> 480 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ROK1 Probe 

<400> 480 

cagctggctt ccattttcct ggcct 

<210> 481 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ROK1 FOR PRIMER 
<400> 481 

tggcaaaact gggttcagag a 
<210> 482 
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<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ROK1 REV PRIMER 

<400> 482 

tcggaccttg tgggatgtg 

<210> 483 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRT1 Probe 

<400> 483 

ccgccgccta atatgcaaca ttaggg 

<210> 484 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT1 FOR PRIMER 

<400> 484 

cgagtattcc aaagctggta teg 

<210> 485 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT1 REV PRIMER 

<400> 485 

atcacagaga gatggecett atct 

<210> 486 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> KRT5 Probe 

<400> 486 

ccgccgccta atatgcaaca ttaggg 

<210> 487 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT5 FOR PRIMER 

<400> 487 

cgagtattcc aaagctggta teg 

<210> 488 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT5 REV PRIMER 

<400> 488 

atcacagaga gatggecett atct 

<210> 489 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT8 Probe 

<400> 489 

ccgccgccta atatgcaaca ttaggg 

<210> 490 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT8 FOR PRIMER 



BHC03 1 051 



<400> 490 

cgagtattcc aaagctggta teg 
<210> 491 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT8 REV PRIMER 

<400> 491 

atcacagaga gatggecett atct 

<210> 492 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT9 Probe 

<400> 492 

ccgccgccta atatgeaaca ttaggg 

<210> 493 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT9 FOR PRIMER 

<400> 493 

cgagtattcc aaagctggta teg 

<210> 494 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT9 REV PRIMER 
<400> 494 

atcacagaga gatggecett atct 
<210> 495 



<211> 26 



BHC03 1 051 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRT10-2 Probe 

<400> 495 

ccgccgccta atatgcaaca ttaggg 

<210> 496 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT10-2 FOR PRIMER 

<400> 496 

cgagtattcc aaagctggta teg 

<210> 497 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT10-2 REV PRIMER 

<400> 497 

atcacagaga gatggecett atct 

<210> 498 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT14 Probe 

<400> 498 

ccgccgccta atatgcaaca ttaggg 

<210> 499 

<211> 23 

<212> DNA 



<213> Artificial Sequence 



BHC 03 1 051 



<220> 

<223> KRT14 FOR PRIMER 

<400> 499 

cgagtattcc aaagctggta teg 

<210> 500 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT14 REV PRIMER 

<400> 500 

atcacagaga gatggecett atct 

<210> 501 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT18 Probe 

<400> 501 

ccgccgccta atatgeaaca ttaggg 

<210> 502 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT18 FOR PRIMER 

<400> 502 

cgagtattcc aaagctggta teg 

<210> 503 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT18 REV PRIMER 
<400> 503 

atcacagaga gatggecett atct 
<210> 504 



BHC03 1 051 



<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT19 Probe 

<400> 504 

ccgccgccta atatgcaaca ttaggg 

<210> SOS 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT19 FOR PRIMER 

<400> 505 

cgagtattcc aaagctggta teg 

<210> 506 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT19 REV PRIMER 

<400> 506 

atcacagaga gatggecctt. atct 

<210> 507 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT6a/b Probe 

<400> 507 

ccgccgccta atatgcaaca ttaggg 

<210> 508 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT6a/b FOR PRIMER 

<400> 508 

cgagtattcc aaagctggta teg 

<210> 509 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT6a/b REV PRIMER 

<400> 509 

atcacagaga gatggecett atct 

<210> 510 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT20 Probe 

<400> 510 

tggcgggaat cctatttatc agactctgta attga 

<210> 511 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> KRT20 FOR PRIMER 

<400> 511 

gcaagaaatc agecataaga aagc 

<210> 512 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<400> 512 

ttgcagctcc tctgagtaaa acat 
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